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ACT" 1 
Field Programmable 


Gate Arrays 





Features 


e High Gate Count 


— A1010/A1010A: 1200 gate array gates 
(3000 PLD/LCATM equivalent gates) 


— A1020/A1020A: 2000 gate array gates 
(6000 PLD/LCA equivalent gates) 


Instant Prototypes and Production 
e Low-Power CMOS Technology 


e Speed-Graded Devices Provide 15% to 25% Performance 
Improvement 


e Toggle Rates to 100 MHz 
VO Drive to 8 mA 
System-Level Performance to 40 MHz 


Gate Array Architecture Allows Completely Automatic Place 
and Route 


Nonvolatile, Permanent Programming 
e Built-In Clock Distribution Network 
e Built-In Diagnostic Probe Pins 


Description 


The ACT 1 family of field programmable gate arrays (FPGAs) 
offers a variety of package, speed, and application combinations. 
Devices are implemented in silicon gate,1.2-micron or 2-micron 
two-level metal CMOS, and they employ Actel’s PLICE™ antifuse 


technology. The unique architecture offers gate array flexibility, 
high performance, and instant turnaround through user 
programming. Device utilization is typically 95% of available logic 
modules. 


ACT 1 devices also provide system designers with unique on-chip 
diagnostic probe capabilities, allowing convenient testing and 
debugging. Additional features include an on-chip clock driver with 
a hardwired distribution network. The network provides efficient 
clock distribution with minimum skew. 


The user-definable I/Os are capable of driving at both TTL and 
CMOS drive levels. Available packages include plastic and ceramic 
J-leaded chip carriers, ceramic quad flatpack, and ceramic pin grid 
array. 


A security fuse may be programmed to disable all further 
programming and to protect the design from being copied or 
reverse engineered. 


The Action Logic System 


The ACT 1 device family is supported by Actel’s Action Logic'M 
System, allowing logic design implementation with minimum 
effort. The Action Logic System (ALS) interfaces with the resident 
CAE system to provide a complete gate array design environment: 
schematic capture, simulation, fully automatic place and route, 
timing verification, and device programming. The Action Logic 
System is available for 386™ PC and for Apollo™ and Sun™ 
workstations, running Viewlogic®, Mentor Graphics®, Valid™, 
and OrCADIM, 





Product Family Profile 


Device CMOS Process Applications Speed Grades Packages 
A1010, A1020 2.0 um C.I, M, B Standard JQCC, COFP CPGA 
A1010A, A1020A 1.2 um C, I, M, B Standard, -1, -2 PLCC, JQCC, CQFP, CPGA 
Important: 
Contact your Actel representative for current availability. See Product 
Plan for package, speed grade, and application combinations. 
© 1691 Actel Corporation March 1991 1-1 
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Figure 1. Partial View of an ACT 1 Device 





ACT 1 Device Structure 


A partial view of an ACT 1 device (Figure 1 above) depicts four 
logic modules and distributed horizontal and vertical interconnect 
tracks. PLICE antifuses, located at intersections of the horizontal 
and vertical tracks, connect logic module inputs and outputs. 
During programming, these antifuses are addressed and 
programmed to make the connections required by the circuit 
application. 


The Actel Logic Module 


The Actel logic module is an eight-input, one-output logic circuit 
chosen for the wide range of functions it implements and for its 
efficient use of interconnect routing resources. 


The logic module can implement the four basic logic functions 
(NAND, AND, OR, and NOR) in gates of two, three, or four 
inputs. Each function may have many versions, with different 
combinations of active-low inputs. The logic module can also 
implement a variety of D-latches, exclusivity function, AND-ORs, 
and OR-ANDs. No dedicated hardwired latches or flip-flops are 
required in the array since latches and flip-flops may be constructed 
from logic modules wherever needed in the application. 


1/0 Buffers 


Each ЏО pin is available as an input, output, three-state, ог 
bidirectional buffer. Input and output levels are compatible with 
standard TTL and CMOS specifications. Outputs sink or source 
4mA at TTL levels. See Electrical Specifications for additional I/O 
buffer specifications. 





Device Organization 


ACT 1 devices consist of a matrix of logic modules arranged in rows 
separated by wiring channels. This array is surrounded by a ring of 
peripheral circuits including I/O buffers, testability circuits, and 
diagnostic probe circuits providing real-time diagnostic capability. 
Between rows of logic modules are routing channels containing 
sets of segmented metal tracks with PLICE antifuses. Each channel 
has 22 signal tracks. Vertical routing is permitted via 13 vertical 
tracks per logic module column. The resulting network allows 
arbitrary and flexible interconnections between logic modules and 
ИО modules. 


Probe Pin 


ACT 1 devices have two independent diagnostic probe pins. These 
pins allow the user to observe any two internal signals by entering 
the appropriate net name in the diagnostic software. Signals may 
be viewed on a logic analyzer using Actel’s Actionprobe™ 
diagnostic tools. The probe pins can also be used as user-defined 
I/Os when debugging is finished. 


ACT 1 Array Performance 


Temperature and Voltage Effects 


Worst-case delays for ACT 1 arrays are calculated in the same 
manner as for masked array products. A typical delay parameter is 
multiplied by a derating factor to account for temperature, voltage, 
and processing effects. However, in an ACT 1 array, temperature 
and voltage effects are less dramatic than with masked devices. 


ACT 1 FPGAs 


The electrical characteristics of module interconnections on 
ACT 1 devices remain constant over voltage and temperature 
fluctuations. 


As a result, the total derating factor from typical to worst case for a 
standard speed ACT 1 array is only 1.19 to 1, compared to 2 to 1 for 
a masked gate array. 


Logic Module Size 


Logic module size also affects performance. A mask programmed 
gate array cell with four transistors usually implements only one 
logic level. In the more complex logic module (similar to the 
complexity of a gate array macro) of an ACT 1 array, 
implementation of multiple logic levels within a single module is 
possible. This eliminates interlevel wiring and associated RC 
delays. The effect is termed “net compression.” 





Ordering Information 


A1020 A - 1 PL 84 c 
| 0 Application 
Package Lead Count 
Package Type 
Speed Grade 
Product Revision 
Part Number 








Product Plan 
A1010 and A1020 Devices 





J-Leaded Cerquad Chip Carrier (JQ) 
44-pin 
68-pin 
84-pin 


Speed Grade 


Application 





Ceramic Quad Flatpack (CQ) 





84-pin 


Ceramic Pin Grid Array (PG) 





84-pin 
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Important: Before ordering, contact your Actel representative for current availability. 


Applications: C = Commercial Availability: = Available Now 
I = Industrial P = Planned 
M = Military — = Not Planned 
B = 883B 
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Product Plan (continued) 
A1010A and A1020A Devices 





























Speed Grade Application 
Std -1 -2 с I M B 
Plastic J-leaded Chip Carrler (PL) 
44-pin Lal Га . Гай ай = = 
68-pin Ld v Гай d v = = 
84-ріп Гай Гай Гай Гай Гай = = 
J-leaded Cerquad Chip Carrier (JQ) 
44-pin = P P - Р Р Р 
68-pin = P P _ P P P 
84-pin _ P P _ P P P 
Ceramic Quad Flatpack (CQ) 
84-pin _ P P = P P P 
Ceramic Pin Grid Array (PG) 
84-pin - P P _ P P P 
Important: Before ordering, contact your Actel representative for current availability. 
Applications: C = Commercial Availability: p^ = Available Now 
I = Industrial P - Planned 
M - Military — = Not Planned 
B = 883B 
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Device Resources 
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Series Modules Gates |44PL,JQ 68 PL, JO 84PL JQ 8400 6 





A1010 295 1200 34 57 N/A NA 57 
A1020 547 2000 34 57 69 69 69 


Absolute Maximum Ratings 
Free air temperature range 


Symbol 
Усс DOC Supply Voltage! 
Vi Input Voltage 
Vo Output Voltage 


Parameter Limits Units 

-0.5 to 47.0 Volts 
-0.5 to 66 +05 = Volts 
-0.5 to Vcc +0.5 Volts 


l< Input Clamp Current +20 mA 
lox Output Clamp Current +20 mA 
lox Continuous Output Current +25 mA 


Tsta Storage Temperature -65 to + 150 °C 


Stresses beyond those listed under “Absolute Maximum Ratings” may 
cause permanent damage to the device. Exposure to absolute maximum 
rated conditions for extended periods may affect device reliability. Device 
should not be operated outside the Recommended Operating Conditions. 


Note: 
1. Vpp = Vcc, except during device programming. 


Recommended Operating Conditions 


Parameter Commercial Industrial Military Units 





Temperature 





Range (Note 1) Oto +70 -40 to +85 -55to +125 °C 
Powar Supply +5 +10 +10 № 
Note: 


1. Ambient temperature (TA) used for commercial and industrial; case 
temperature (Tc) used for military. 


Power Dissipation 


The following formula is used to calculate total device dissipation. 


Total Device Power (mW) = (0.20 x Nx F1) + (0.085 x Mx F2) + 
(0.80 x P x ЕЗ) 


Where: 
F1 = Average logic module switching rate in MHz. 
F2 = CLKBUF macro switching rate in MHz. 
F3 = Average I/O module switching rate in MHz. 


М = Number of logic modules connected to the CLKBUF 
macro. 


N = Total number of logic modules used in the design 
(including M). 


P= Number of outputs loaded with 50 pF 


Average switching rate of logic modules and of I/O modules is 
some fraction of the device operating frequency (usually 
CLKBUF). Logic modules and I/O modules switch states (from 
low-to-high or from high-to-low) only if the input data changes 
when the module is enabled. A conservative estimate for average 
logic module and I/O module switching rates (variables F1 and F3, 
respectively) is 10% of device clock driver frequency. 


If the CLKBUF macro is not used in the design, eliminate the 
second term (including F2 and M variables) from the formula. 


Sample A1020 Device Power Calculation 


To illustrate the power calculation, consider a large design 
operating at high frequency. This sample design utilizes 85% of 
available logic modules оп the A1020-series device (.85 א‎ 547 = 465 
logic modules used). The design contains 104 flip-flops (208 logic 
modules). Operating frequency of the design is 16 MHz. In this 
design, the CLKBUF macro drives the clock network. Logic 
modules and I/O modules are switching states at approximately 
10% of the clock frequency rate (.10x 16 MHz = 1.6 MHz). Sixteen 
outputs are loaded with 50 pF. 


To summarize the design described above: N = 464;M = 208; 
F2 = 16; Fl = 4; F3 = 4; P = 16. Total device power can be 
calculated by substituting these values for variables in the device 
dissipation formula. 


Total device power for this example = 


(0.20х 465 x 1.6) + (0.085 x 208 x 16) + (0.80x 16x 1.6) = 452 mW 








Electrical Specifications 







Parameter 





(он = -4 mA) 3.84 V 
Vou! 
₪ Чон = -3.2 mA) 37 37 у 
Vor! ) = 4 mA) 0.33 0.40 0.40 v 
V. -0.3 0.8 -0.3 0.8 -0.3 0.8 V 
Input Transition Time ta, te? 500 500 500 ns 
Со WO Capacitance? 3 10 10 10 pF 
Standby Current, Icc* 10 20 25 mA 
Leakage Current® -10 10 -10 10 -10 10 НА 
los Output Short (Vo = Vco) 20 140 20 140 20 140 mA 
Circuit Current® (yo = GND) -10 -100 -10 -100 -10 -100 mA 
Notes: 
1. Only one output tested at a time. Vcc = min. 
2. Not tested, for information only. 
3. Includes worst-case 84-pin PLCC package capacitance. Vout = 0 V, = 1 MHz. 
4. Typical standby current = 3 mA. All outputs unloaded. All inputs = Vcc or GND. 
5. Vo, VIN = Vcc or GND. 
6. Only one output tested at a time. Min. at Vcc = 4.5 V; Max, at Vcc = 5.5 V. 


Package Thermal Characteristics 


The device junction to case thermal characteristic is 0jc, and the Maximum junction temperature is 150°C. 


junction to ambient air characteristic is Өја. Тһе thermal A sample calculation of the maximum power dissipation for an 
characteristics for Өја are shown with two different air flow rates. 84-pin ceramic pin grid array at military temperature is as follows: 


Max. junction temp. (°C) - Max. military temp. (°C) 150°C - 125°C 











= = 0.76W 
6ja (°C/W) 33°C/W) 
Package Type Pin Count Өје s. o ik Units 
Plastic J-leaded Chip Carrier 44 15 52 40 °C/W 
68 13 45 35 см 
84 12 44 33 °C/W 
Cerquad J-leaded Chip Carrier 44 8 38 30 °C/W 
68 8 35 25 °C/W 
84 8 34 24 °C/W 
a I e... וו‎ 
Ceramic Quad Flatpack 84 5 40 30 °CW 


cA ee ae‏ יו וי 
Ceramic Pin Grid Array 84‏ 
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Functional Timing Tests AC Test Specifications 

| Усс = 5.0 V; Ta = 25°С 
AC timing for logic module internal delays is determined after —— E 
place and route. The ALS Timer utility displays actual timing Parameter Min. Max. Units 
parameters for circuit delays. ACT 1 devices are AC tested to a : 
“binning” circuit specification. Binning Circuit Delay (teoa) 





A1010 87 ns 
The circuit consists of one input buffer + n logic modules + one A1010A 
output buffer (n = 16 for A1010/A1010A; n = 28 for A1020/A1020A). 
The logic modules are distributed along two sides of the device, as Standard Speed 87 ns 
inverting or non-inverting buffers. The modules are connected -1 Speed Grade 72 ns 
through programmed antifuses with typical capacitive loading. -2 Speed Grade 65 nè 
Propagation delay [tpp = (teun + teuL)2] is tested to the following 
AC test specifications. A1020 135 ns 
A1020A 
Standard Speed 135 ns 
-1 Speed Grade 112 ns 
-2 Speed Grade 101 ns 








Output Buffer Performance Derating 





Їон (MA) 





===. Military, worst case values at 125°C, 4.5 V. 
— Commercial, worst case values at 70°C, 4.75 V. 





Note: 


The above curves are based on characterizations of sample devices and are 
not completely tested on all devices. 
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Timing Characteristics and critical (speed-sensitive) nets are given below. Most nets will 
fall into the “typical” category. 

Timing is design-dependent; actual delay values are determined 

after place and route of the design using the ALS Timer utility. The 

following delay values use statistical estimates for wiring delays 

based on 85% to 90% module utilization. Device utilization above 

95% will result in performance degradation. 


Less than 1% of all routing in a design requires the use of “long 
tracks.” Long tracks, long vertical or horizontal routing paths, are 
used by the autorouter only as needed. Delays due to the use of long 
tracks range from 15 ns to 35 ns. Long tracks may be used to route 
the least-critical nets in a given design. 

With ALS place and route programs, the user can assign criticality 

level to a net, based on timing requirements. Delays for both typical 


Logic Module Timing 
Мос = 5.0 V; Ta = 25°C; Process = Typical; סקז‎ = 3.0 ns ₪ FO = 0 





Single Logic Module Macros 
(e.g., most gates, latches, multiplexors)' 








Parameter Output Net FO=1 ҒО = 2 РО = 3 РО = 4 РО = 8 Unite 
tep Critical 5.4 5.8 6.2 85 Note 2 ns 
tep Typical 6.3 6.7 77 8.6 10.8 ns 





Dual Logic Module Macros 
(e.g., adders, wide Input gates)! 








Parameter Output Net РО = 1 FO = 2 FO = З РО = 4 ҒО = 8 Units 
tep Critical 9.2 9.6 10.0 12.3 Note 2 ns 
teo Typical 10.2 10.6 11.6 12.5 14.6 ns 





Sequentlal Element Timing Characteristics 











Parameter 
tsu Set Up Time, Data Latches 3.5 3.9 4.2 4.5 4.8 ns 
tsu Set Up Time, Flip-Flops 3.9 3.9 3.9 3.9 3.9 ns 
ty Hold Time 0 0 0 0 0 ns 
tw Pulse Width, Minimum? 7.7 8.5 9.2 10.0 14.0 ns 
tco Delay, Critical Net 54 5.8 6.2 8.5 Note 2 ns 
tco Delay, Typical Net 6.3 67 7.7 86 10.8 ns 
Notes: 


1. Most flip-flops exhibit single module delays. 
2. Critical nets have a maximum fan-out 01 6. 
3. Minimum pulse width, tw, applies to CLK, PRE, and CLR inputs. 
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VO Buffer Timing 
Voc = 5.0 V; Ta = 25°C; Process = Typical 


INBUF Macros 











Parameter From - To FO=1 FO=2 FO = 3 РО = 4 РО = 8 Units 
‚ {єн Pad to Y 6.9 76 8.9 10.7 14.3 ns 
tein Pad to Y 59 6.5 77 8.4 12.4 ns 





CLKBUF (High Fan-Out Clock Buffer) Macros 











Parameter FO = 40 FO = 160 FO = 320 Units 
н. 9.0 12.0 15.0 ns 
tPLH 9.0 12.0 15.0 ns 
Notes: 
1. A clock balancing feature is provided to minimize clock skew. 2. There is no limit to the number of loads that may be connected to the 


CLKBUF macro. 


OUTBUF, TRIBUFF & BIBUF Macros 








С, = 50 pF 
Parameter From - To CMOS TTL Units 
teu D to Pad 3.9 4.9 ns 
{рн D to Pad 7.2 5.7 ns 
tpuz E to Pad 5.2 3.4 ns 
їн E to Pad 6.5 4.9 ns 
taz E to Pad 6.9 5.2 ns 
ід Е ќо Рад 4.9 5.9 ns 





Change In Propagation Delay with Load Capacitance 





Parameter From - To CMOS TTL Units 
{рн D to Pad 0.03 0.046 ns/pF 
tPLH D to Pad 0.07 0.039 ns/pF 
{нг Е to Раа 0.08 0.046 ns/pF 
{рун E to Pad 0.07 0.039 ns/pF 
teiz E to Pad 0.07 0.039 ns/pF 
tez E to Pad 0.03 0.039 ns/pF 
Notes: 
1. The BIBUF macroinput section exhibits the same delays as the INBUF 2. Load capacitance delay delta can be extrapolated down to 15 pF 
macro. minimum. 
Example: 


Delay for OUTBUF driving a 100-pF TTL load: 
tpt = 4.9 + (.046 x )100-50(( = 4.9 + 2.3 = 7.2 ns 
{рн = 5.7 + (.039 x (100-50)) = 5.7 + 2.0 = 7.7 ns 


- 
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Timing Derating specifications. The derating factors shown in the table below are 

based on the recommended operating conditions for ACT 1 
Operating temperature, operating voltage and device processing commercial, industrial, and military applications. The derating 
conditions, along with device die size and speed grade, account for curves show worst-to-best case operating voltage range and 
variations in array timing characteristics. These variations are best-to-worst case operating temperature range. 


summarized into a derating factor for ACT 1 array typical timing 


Timing Derating Factor (x typical) 





Commercial Industrial Military 
Device BestCase Worst Case Beet Case Worst Case BestCase Worst Case 
A1010, A1020 0.45 1.54 0.40 1.65 0.37 1.79 
A1010A, A1020A 
Standard Speed 0.45 1.54 0.40 1.65 0.37 1.79 
-1 Speed Grade 0.45 1.28 0.40 1.37 0.37 1.49 
-2 Speed Grade 0.45 1.13 0.40 1.20 0.37 1.32 
Note: 
Best case reflects maximum operating voltage, minimum operating tem- voltage, maximum operating temperature, and worst case processing. Best 
perature, and best case processing. Worst case reflects minimum operating case derating is based on sample data only and is not guaranteed. 
Voltage Derating Curve Temperature Derating Curve 





45 475 50 625 55 "80 -40 -20 O 20 40 60 ВО 100 120 
Voc (Volts) Temperature (°C) 





Output Buffer Delays 


To AC test loads (shown below) 





ын н. 
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AC Test Loads 
Load 1 Load 2 
(Used to measure propagation delay) (Used to measure rising/falling edges) 
Vcc GND 
To the output under test e e 
50 pF R to Voc for teiz/tez. 
| R to GND for tenz/tezH 
== סד‎ the output under test R=1kQ 
d 
Input Buffer Delays Module Delays 
9 Fan-out = 2 
Fan-out = 2 u M 
B 
Y 
PAD 
Voc 
S,AorB 





tex 






50% 50% 
Vec 
50% 
50% 
IL 


tei 


н 





D-Type Flip-Flop and Clock Delays 


p PRE g 
Fan-out = 2 
P cin 


(Positive edge triggered) 





tsu = Setup time 
ta = Hold time (0 ns) 
tw = Minimum pulse width (10 ns) 
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Soft Macro Library Overvlew 























Macro Name Modules Req'd Description Levels of Logic 
Counters 

CNT4A 17 4 bit loadable binary counter with clear 

CNT4B 15 4 bit loadable bin counter w/ clr, active low carry In & carry out 

UDCNT4A 24 4 bit up/down cntr w/ sync active low load, carry in & carry out 

Decoders 

DEC2X4 4 2 to 4 decoder 1 
DEC2X4A 4 2 to 4 decoder with active low outputs 1 
DEC3X8 8 3 to 8 decoder 1 
DEC3X8A 8 3 to 8 decoder with active low outputs 

DEC4X16A 20 4 to 16 decoder with active low outputs 2 
DECE2X4 4 2 to 4 decoder with enable 

DECE2X4A 4 2 to 4 decoder with enable and active low outputs 1 
DECE3X8 11 3 to 8 decoder with enable 2 
DECE3X8A 11 3 to B decoder with enable and active low outputs 2 





Latches and Registers 



































DLC8A 8 Octal latch with clear 1 
DLE8 8 Octal latch with enable 1 
DLM8 8 Octal latch with mulitplexed inputs 1 
REGE8A 20 Octal register with preset and clear, active high enable 2 
REGE8B 20 Octal register w/ active low clock, preset & clear, active high en. 2 
Adders 

FA1 3 One bit full adder 3 
FADD8 37 8 bit fast adder 4 
FADD12 62 12 bit fast adder 5 
FADD16 78 16 bit fast adder 5 
FADD24 120 24 bit fast adder 6 
FADD32 160 32 bit fast adder 7 
Comparators 

ICMP4 5 4 bit identity comparator 2 
ICMP8 9 8 bit identity comparator 3 
MCMP16 93 16 bit magnitude comparator 5 
MCMPC2 9 2 bit magnitude comparator with enables 3 
MCMPC4 18 4 bit magnitude comparator with enables 4 
MCMPC8 36 8 bit magnitude comparator with enables 6 
Multiplexors 

MX8 3 8 to 1 multiplexor 2 
MX8A 3 8 to 1 multiplexor with an active low output 

MX16 5 16 to 1 multiplexor 

Multipilers 

SMULT8 235 8 x 8 two's complement multiplier Varies 











Soft Macro Library Overview (continued) 























Macro Name Modules Req'd Description Levels of Logic 
Shift Registers 

SREG4A 8 4 bit shift register with clear 

SREG8A 18 B bit shift register with clear 2 
TTL Replacements 

TA138 12 3 to 8 decoder with 3 enables and active low outputs 2 
TA139 4 2 to 4 decoder with an enable and active low outputs 1 
TA151 5 8 to 1 multiplexor with enable, true, and complementary outputs 3 
TA153 2 4 to 1 multiplexor with active low enable 2 
TA157 1 210 1 multiplexor with enable 1 
TA161 22 4 bit sync counter w/ load, clear, count enables & ripple carry out 3 
TA164 18 8 bit serial in, parallel out shift register 1 
TA169 25 4 bit synchronous up / down counter 6 
TA181 81 4 bit ALU 4 
TA194 14 4 bit shift register 1 
TA195 10 4 bit shift register 1 
TA269 50 8 bit up/down cntr w/ clear, load, ripple carry output & enables 8 
TA273 18 Octal register with clear 1 
TA280 9 Parity generator and checker 4 
TA377 16 Octal register with active low enable 1 
Super Macros 

UART 189 Universal Asychronous Receiver / Transmitter 7-Tx 4-Rx 
MC 102 DRAM Controller Varies 
DMA 225 Direct Memory Access Controller Varies 
SINT 180 SCSI Interface Controller Varies 
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Hard Macro Library Overview 
The following illustrations show all the available Hard Macros. 


2-Input Gates (Module Count = 1) 








3-Input Gates (Module Count = 1, unless indicated otherwise) 


(2) Indicates 2-module macro 
Á. Indicates extra delay input 
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4-Input Gates (Module Count = 1, unless Indicated otherwise) 
(©) Indicates 2-module macro 


Á. Indicates extra delay input 








XOR Gates XOR OR Gates XOR AND Gates 
(Module Count = 1) (Module Count = 1) (Module Count = 1) 











AND XOR Gates 
(Module Count — 1) 


Ope Op Re 
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AND OR Gates (Module Count = 1) 
(2) Indicates 2-module macro 
Á. Indicates extra delay input 











us 
y 





ACT 1 FPGAs 





OR AND Gates (Module Count = 1) 
@ Indicates 2-module macro 
A Indicates extra delay Input 


< 


< 


оо <|ש|‎ 
- І . 
< 








Buffers (Module Count = 1) 











A Y A Y A 0 
МО Buffers (I/O Module Count = 1) 
BIBUF PAD 


E 
PAD p" PAD 


Multiplexors (Module Count = 1) 
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Latches (Module Count = 1) 


D Q D QN O 
DL1 DLIA 
G G 


D Latches with Clear (Module Count = 1) 











D Latches with Enable (Module Count = 1) 


D Q D Q D Q 
E DLE Q) E DLEA E DLEB С] 


Мих Latches (Module Count = 1) 
DLME1A 


A A A 
B e B a B Q 
DLM DLMA s 
s s (ЛЕ 
G Je Je 
Adders (Module Count = 2) 
A QA A A 
B B B B 
co co co D сор 
s s sp S 
HAI НАТА НА1В HAIC 


Macros FA1A, FA1B, and FA2A have two level delays from the inputs to the S outputs, as indicated by the À. 
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D Flip-Flops (Module Count = 2) 


D QNID D Q e, 
DF1A DF1B 
[2 > > Cp 





D Flip-Flops with Clear 


D Q 
DFC1 
x 
CLR 





D Flip-Flops with Preset 


PRE 
D Q 
DFP1 
> 


D Flip-Flops with Preset and Clear 
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D Flip-Flops with Enable (Module Count = 2) 


D Q D Q 
E DFEA Q E DFE1B 
> CP > 








JK Flip-Flops (Module Count = 2) 





Mux Filp-Flops (Module Count = 2) 


DFME1A 


A A 
B 9 B Q 
DFM s 
s Oe 
> > CLK 








CLKBUF Interface Macros (Module Count = 1) 
A " 4 3 A " A т 


D0 


Y D1 
Y 
D2 


D3 
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Package Pin Assignments 









(Top View) 
a 
z 
$ ë 
D D 
3 
з D PRB or /O 
зв D PRA or | 5e P PRB or VO 
зт E! DCLK or I/O PRA or I/O 
GND se P SDI or 1/0 DCLK or 1/0 
35 BV, SDI or I/O 
34 כ‎ MODE Voc 
33 B CLK or 1/0 54 D MODE 
Voc 32 P GND 
31 CLK or I/O 
Vpp C] 30 
29 
GND 
E: 8 
$ > 
9 
а 5 
m 
$ ő E 
[] ו‎ 0 
4 
סא‎ та H PRA or I/O 
DCLK or I/O 
72 Ø SDI or 1/0 
GNDG V. 
GNDL] 5 
MODE 
CLK or VO 
V. в [2 GND 
Veo ₪ כ‎ GND 
О [] 
a 
£ 2 $ 
Notes: 
1. Vpp must be terminated to Vcc, except during device programming. 3. Unused I/O pins are designated as outputs by the Action Logic System 
2. MODE must be terminated to circuit ground, except during device and are driven low. 


programming or debugging. 4. All unassigned pins are available for use as I/Os. 
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Package Pin Assignments (continued) 











з D PRB or /O 
р PRA or I/O 5e D PRB or I/O 
DCLK or YO se D PRA or I/O 
SDI or I/O DCLK or I/O 
Voc SDI or /O 
MODE V 
CLK or I/O 54) MODE 
GND 
CLK or I/O 
GND 
Q 
3 5 
$ б Ё 
П 0 
4 
ned 74 B PRA or !/O 
OCLK or I/O 
SDI or /O 
GNDG Dy, 
GNDG $7 [3 Ve 
2 MODE 
P CLK or I/O 
M [3 GND 
Veo Б GND 
40 
0 
б 
+ 2 8 
Notes: 
1. Vpp must be terminated to Усс, except during device programming. 3. Unused 1/O pins are designated as outputs by the Action Logic System 
2. MODE must be terminated to circuit ground, except during device and are driven low. 
programming or debugging. 4. All unassigned pins are available for use as I/Os. 
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Package Pin Assignments (continued) 


Pin #1 
Index 





NC 1 63 PRA or I/O 
2 62 DCLK or I/O 
3 81 SDI or I/O 
4 60 
5 58 
в 58 
GND 7 57 Vop 
GND 5 5° Мор 
9 55 MODE 
10 54 
1 53 CLK or I/O 
12 52 
13 51 
14 50 GND 
M 15 49 GND 
18 48 
17 4 
18 48 
19 45 
20 44 
21 43 
Notes: 
1. Vpp must be terminated to Vcc, except during device programming. 3. Unused I/O pins are designated as outputs by the Action Logic System 
2. MODE must be terminated to circuit ground, except during device and are driven low. 
programming or debugging. 4. All unassigned pins are available for use as I/Os. 
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Package Pin Assignments (continued) 


OOOOOOOOOOQOI- 
0 0 0 ס ס0 ס ס ס ס‎ 0 Olin 


A 
B 
c 
D 
E 
F 
G 
H 
J 
K 
L 





@ Orientation Pin (C3) 











Signal A1010-Serles Devices A1020-Series Devices 
PRA A11 A11 
PRB B10 B10 
MODE E11 E11 
SDI B11 B11 
DCLK C10 C10 
Vpp K2 K2 
CLK or I/O F9 F9 
GND B7, E2, E3, K5, F10, G10 B7, E2, ЕЗ, K5, F10, G10 
Vcc B5, F1, G2, K7, E9, E10 B5, F1, G2, K7, E9, E10 
N/C (No Connection) B1, B2, C1, C2, K1, J2, L1, J10, K10, K11, C11, D10, D11 B2 
Notes: 
1. Vpp must be terminated to Усс, except during device programming. 3. Unused I/O pins are designated as outputs by the Action Logic System 
2. MODE must be terminated to circuit ground, except during device and are driven low. 
programming or debugging. 4. All unassigned pins are available for use as I/Os. 
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Package Mechanical Details 
J-Leaded Cerquad Chip Carrier 










.023" min. Pin 1 Index Chamfter 


E 


mi 


ד 






.050” Se hd 
ד‎ 


.028" + mE 
= 


019” + .003” 


магі 
=F 


105" + .010” pio 


.155" 
180" 


















Lead Count D,E D1, E1 
.690" + .005" .650" + .008" 
990" + .005” .950" + .008" 
1.190" + .005" 1.150" + .008” 


Plastic J-Leaded Chip Carrler 








020” min. 7 


175" + 010" 


Lead Count DE D1, E1 
44 .690" + .005" .655" + .005" 


68 .990” + .005” .955" x .005” 


84 1.190” + .005” 1.155” + .005” 
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Package Mechanical Details (continued) 
Ceramic Quad Flatpack 


0.058” + .010” 


0.085” + .010” 


010" > .002” T 


_ _ "005 + "025. 
ד 








-- .050” + .010” 
0.350" + .025" 
0.006" + .002" 








D,E D1, E1 
84 1.350” + .030” 0.650” + .010” 


Lead Count 


en 








Package Mechanical Details (continued) 
Ceramic Pin Grid Array 


.050" + .010” 
(4 pics) 








оо 
оо 
оо 
OO 
оо 
O O 
оо 
оо 
оо 
оо 


e 0 

OOO 
оооооооое 
@ Orientation Pin 





Q 
о 





ACT" 2 
Field Programmable 


Achte] 





Gate Arrays 


Preliminary 








Features 

Up to 8000 Gate Array Gates 

(20,000 PLD/LCATM equivalent gates) 

Up to 1232 Logic Modules 

Up to 140 User-Programmable 1/О Pins 

Package Sizes to 176-Pin CPGA 

Instant Prototypes and Production 

Nonvolatile, Permanent Programming 

Flip-Flop Toggle Rates to 120 MHz 

System-Level Performance to 60 MHz (A1240 & A1225) 
Supports Wide-Input Combinatorial Functions 

Supports Single-Module Sequential Functions 

Two In-Circuit Diagnostic Probe Pins Support 50-MHz 
Analysis 

e Low-Power CMOS Technology: Standby current < 300 „A 
* Two High-Speed, Low-Skew Clock Networks 

e I/O Drive to 8 mA 


Product Family Profile 











Device A1280 A1240 A1225 
Capacity 
Gate array equivalent gates 8000 4000 2500 
PLD/LCA equivalent gates 20,000 10,000 6250 
TTL equivalent packages 210 105 70 
CMOS Process 1.2 um 1.2 um 1.2 um 
Logic Modules 1232 684 451 
Flip-Flops (maximum) 998 565 341 
PLICETM Antifuses 750,000 400000 250,000 
User I/Os 140 104 82 
Packages 176 CPGA 132 CPGA 100 CPGA 
Description 


The ACT™ 2 family is the next generation of field programmable 
gate arrays from Actel. Based on Actel’s patented channeled array 
architecture, the ACT 2 family provides significant enhancements 
to gate density and performance while maintaining upward 
compatibility from ACT 1 designs. The devices are implemented in 
silicon gate, 1.2-um, two-level metal CMOS, and they employ 
Actel’s PLICE antifuse technology. The unique architecture offers 
gate array flexibility, high performance and instant turnaround 
through user programming. Designs of up to 8000 gates can be 
implemented with the A1280 device. The ACT 2 family is 


supported by the Action Logic™ System (ALS). ALS is available 
onSun™, Apollo™, and 386 PC platforms, with CAE interfaces 
to Valid™, Viewlogic®, Mentor Graphics®, and OrCADTM. 


Enhanced Logic Module 


The logic module used in the ACT 2 family is an enhanced version 
of the ACT 1 module, allowing improved implementation of high 
fan-in combinatorial macros. The ACT 2 logic modules are also 
optimized to implement high-speed flip-flops and latches. With 
these enhancements, up to 100% performance improvement over 
ACT 1 devices can be realized. 


Figure 1 depicts part of a state machine implemented with five 
ACT 2 modules. The critical path delay is only two module delays, 
compared to a critical path delay of four module delays for a similar 
design implemented with ACT 1 devices. 








Figure 1. Part of State Machine Implemented In 


Five ACT 2 Modules, 





Enhanced Architecture 


Routing efficiency with large gate counts is accomplished with 
increased routing resources, increased antifuse programming 
elements, and architectural enhancements. Horizontal routing 
tracks/channel increase to 36 (vs 25 for ACT 1); vertical routing 
tracks/column increase to 15 (vs 13 for ACT 1). All speed-critical 
module-to-module connections are accomplished with only two 
low-resistance antifuse elements. Most connections are 
implemented with either two or three antifuse elements (as shown 
in Figure 2). No connections are allowed with more than four 
antifuse elements in the path. The result is predictable 
performance with fully automatic placement and routing. Device 
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utilization is typically 85% to 95% of available logic modules and 


80% of gates. 
B 









uncommitted vertical segment 





Figure 2. ACT 2 Enhanced Routing Architecture 


Programmable I/O Pins 


Each ЏО pin is available as an input, output, three-state, or 
bidirectional buffer. Inputs are TTL and CMOS-compatible. 
Output drive levels meet 8-mA TTL and 6-mA HCT standards. 


User-programmable slew rate is provided to limit simultaneous 
switching noise. Optional transparent latches at the I/O pins are 
provided for both inputs and outputs. I/O latches can be combined 
automatically with latches in the array to implement master-slave 
flip-flops, as depicted in Figure 3. 


Low-Skew Network 


Two low-skew distribution networks are provided. Each network 
can be driven by either of two dedicated I/O pins or from internal 
logic. 


Enhanced Programming and Test 


The ACT 2 family provides the same guaranteed functionality as 
ACT 1 provides. All routing tracks, logic modules, and program, 
debug, and test circuits are fully tested in the factory. Verification of 
correct antifuse programming is performed automatically with 
Actel’s Activator™ 2 programming and debug hardware. The 
ACT 2 architecture implements an enhanced programming and 
test algorithm. Programming time for designs up to 8000 gates is 
comparable to programming time for 2000-gate designs 
implemented with an ACT 1 array. 


Probe Pins 


ACT 2 family devices have two independent diagnostic probe pins, 
PRA and PRB. These pins allow the user to observe any two 
internal signals by entering the appropriate net name in the 
diagnostic software. Signals may be viewed on a logic analyzer 
using Actel’s Actionprobe™ diagnostic tools. The probe pins can 
also be used as user-defined I/Os when debugging is finished. 





LOGIC MODULES 





Figure 3. Latched User I/Os 





Ordering Information 


A1280 - -1 PG 176 С 
| L Application 
Package Lead Count 
Package Type 
Speed Grade 
ig Pie 
Part Number 


Absolute Maximum Ratings 
Free air temperature range 


Symbol Parameter Limits Units 


Vcc DOC Supply Votltage!:2.3 
V Input Voltage 
Vo Output Voltage 


-0.5 to 0 Volts 
-0.5 to Мос +05 Volts 
-0.5 to Vcc +0.5 Volts 


li Input Clamp Current +20 mA 
lox Output Clamp Current +20 mA 
lok Continuous Output Current +25 mA 


Tstg Storage Temperature -65 to + 150 °C 


Stresses beyond those listed under “Absolute Maximum Ratings” may 
cause permanent damage to the device. Exposure to absolute maximum 
rated conditions for extended periods may affect device reliability. Device 
should not be operated outside the Recommended Operating Conditions. 
Notes: 

1. Vpp = Vcc, except during device programming. 

2. Vsv = Vcc, except during device programming. 

3. Vxs = GND, except during device programming. 


Recommended Operating Conditions 





Parameter Commercial Industrial Military Units 
Temperature x ЕЧ 
Range (Note 1) Oto +70 4010 +85 -55t0 +125 °C 
Power Supply 
Tolerance +5 +10 +10 %Vcc 
Note: 


1. Ambient temperature (TA) used for commercial and industrial; case 
temperature (Tc) used for military. 


ACT 2 FPGAs 











Packages 
Part 
Number Package Type Lead Count 
A1280 PG (Ceramic Pin Grid Array) 176 
A1240* PG (Ceramic Pin Grid Array) 132 
A1225* PG (Ceramic Pin Grid Array) 100 
Applications 
0 Commercial 
₪ Industrial 
M* Military 
B* 883 B 
Speed Grades 
Part 
Number Speed Grade 
A1280 Standard (no mark) 
-1* 
* Important: 


Contact your Actel representative for current availability. 


Power Dissipation 

Total device power = Input buffer power + Output buffer power 
+ Array power + Clock network power. 

A1280 Device Power Calculation 


The following equations can be used to calculate power dissipa- 
tion for an A1280 device design: 


1. Input buffer power: N x (5.5 + 0.008 x Vec? x f) x 10° W 

2. Output buffer power: Сү x Vcc? x f x N x 0.55 x 10% W 

3. Array power: N x Vcc? x f x 65 א‎ 10° W 

4. Clock network power: (0.8 א‎ M + 160)x Vcc2x fx 1.18x 10% W 
Where: 


N = Number of modules or I/O buffers switching at frequency f 
f = Clock frequency (MHz) 

C, = Capacitance (pF) 

M =Number of clock loads 


For example, by substituting sample values for the variables in 
equations 1 through 4: 


Input buffers (equation 1): 
N = 32; Voc = 5 V; f = 30 MHz 


Output buffers (equation 2): 

Ck = 50 pF; Усс = 5 V; f = 30 MHz; N = 32 
Array (equation 3): 

N = 120; Vcc = 5 V; f = 30 MHz 

Clock network (equation 4): 

М = 300; Vcc = 5 V; f = MHz 


Total device power for this example = .35 W + .66 W + .60 W + 
35 W = 196 W 
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Electrical Specifications 














Parameter 














(он = -6 mA) 3.84 у 
von! (он = -4 mA) 37 37 у 
Vou! (lo, = 6 mA) 0.33 0.40 0.40 у 
VL -0.3 0.8 -0.3 0.8 -0.3 0.8 у 
Ун 20 Vcc + 0.3 20 Vcc + 0.3 20 Усс + 0.3 у 
Input Transition Time ta, {2 50 50 50 ns 
Cio I/O Capacitance? 3 10 10 10 pF 
Standby Current, Ioc4 350 TBD TBD pA 
Leakage Current® -10 10 -10 10 -10 10 pA 
los Output Short _ (Vo = Voc) 20 140 20 140 20 140 mA 
Circuit Current? | (v, = GND) -10 -100 -10 -100 -10 -100 mA 

Notes: 


. Only one output tested at a time. Vcc = min. 

. Not tested, for information only. 

. Includes worst-case 176 CPGA package capacitance. Vour = 0 V, f = 1 MHz. 
. All outputs unloaded. All inputs = Vcc or GND. 

. Vo, Уш = Усс ог GND. 

. Only one output tested at a time. Min. at Усс = 4.5 V; Max at Vcc = 5.5 V. 


оол + о кюю 


Package Thermal Characteristics 


The device junction to case thermal characteristic is Өјс, and the Maximum junction temperature is 150°C. 
junction to ambient air characteristic is Өја. The thermal A sample calculation of the maximum power dissipation for a 
characteristics for @ja are shown with two different air flow rates. 176-pin ceramic pin grid array at military temperature is as follows: 


Max. junction temp. (°C) - Max. military temp. (^C) — 150°C - 125°C 








= =12W 
Өја (°C/W) 20°C/W 
Package Type Pin Count Өје аи 200 min. Units 
Ceramic Pin Grid Array (PG) 176 2 20 10 °C/W 
132 TBD TBD TBD *CN 
100 TBD TBD TBD *CN 


ACT 2 FPGAs 





Timing Characteristics and critical (speed-sensitive) nets are given below. Most nets will 
fall into the “typical” category. 

Timing is design-dependent; actual delay values are determined 

after place and route of the design using the ALS Timerutility. The 

following delay values use statistical estimates for wiring delays 

based on 85% to 90% module utilization. Device utilization above 

95% will result in performance degradation. 


Less than 1% of all routing in a design requires the use of “long 
tracks.” Long tracks, long vertical or horizontal routing paths, are 
used by the autorouter only as needed. Delays due to the use oflong 
tracks range from 15 ns to 30 ns. Long tracks may be used to route 
the least-critical nets in a given design. 

With ALS place and route programs, the user can assign criticality 

level to а net, based on timing requirements. Delays for both typical 


A1280 Logic Module Timing 
Voc = 5.0 V; Ta = 25°C; Process = Typical; tgp = 3.0 ns @ FO = 0 





Single Logic Module Macros 
(e.g., most gates, latches, multiplexors) 








Parameter Output Net РО = 1 РО = 2 ҒО = 3 FO = РО = 8 Units 
tep Critical 5.5 5.8 6.2 Note 1 Note 1 ns 
to Typical 6.2 6.5 7.4 8.3 10.2 ns 





Dual Logic Module Macros 
(e.g., adders, wide Input gates) 








Parameter Output Net РО = 1 РО = 2 РО = 3 РО = 4 РО = 8 Units 
tep Critical 8.5 8.8 9.2 Note 1 Note 1 ns 
tep Typical 92 65 74 8.3 10.2 ns 





Sequential Element Timing Characteristics 





Parameter 
tsu Set Up Time, Data Latches 1.0 1.0 1.0 1.0 1.0 ns 
tsu Set Up Time, Flip-Flops 1.0 10 1.0 1.0 10 ns 
ty Hold Time 0 0 0 0 0 ns 
tw Pulse Width, Minimum2 10.0 10.0 10.0 10.0 10.0 ns 
tco Delay, Critical Net 5.5 5.8 6.2 Note 1 Note 1 ns 
tco Delay, Typical Net 6.2 65 74 8.3 10.2 ns 





Notes: 
1. Critical nets have a maximum fan-out of 6 and a recommended fan-out > 
2. Minimum pulse width, tw, applies to CLK, PRE, and CLR inputs. 
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A1280 I/O Buffer Timing 
Vcc = 5.0 V; Ta = 25°C; Process = Typical 
INBUF Macros 
Parameter From - To FO = 1 FO=2 FO = 3 РО = 4 РО = 8 Units 
{ро Pad to Y 6.2 65 74 8.3 10.2 ns 
Notes: 


1. tpp = ten + tPLH 
2 


CLKBUF (High Fan-Out Clock Buffer) Macros 























Parameter FO = 140 FO = 280 FO = 700 Unite 
teu 97 10.9 14.5 ns 
їн. 97 11.0 14.8 ns 

Notes: 

1. A clock balancing feature is provided to minimize clock skew. 2. There is no limit to the number of loads that may be connected to the 

CLKBUF macro. 
OUTBUF, TRIBUFF & BIBUF Macros 
C = 50 pF 
Parameter From - To CMOS TTL Units 

tepi D to Pad 8.9 7.1 ns 
tep2 D to Pad 14.9 12.1 ns 
toz E to Pad 6.8 6.8 ns 
tzo E to Pad 9.8 9.8 ns 
tco G to Pad 10.8 10.8 ns 

Notes: 

1. The BIBUF macro input section exhibits the same delays as the INBUF 3. 1р2 = teuz + trez 

macro. 2 


2. tpp is slew-rate dependent: 
tppı = high-slew output. 
tpp2 = low-slew output. 


4. tzp = fpzH + 1. 
2 
5. tco is latched data out. 


Timing Derating summarized into a derating factor for ACT 2 array typical timing 

specifications. The derating factors shown in the table below are 
Operating temperature, operating voltage and device processing based on the recommended operating conditions for ACT 2 
conditions, along with device die size and speed grade, account for applications. 


variations in array timing characteristics. These variations are 


Timing Derating Factor (x typical) 








Commercial 
Device Best Case Worst Case 
A1280 
Standard Speed 0.40 1.40 
-1 Speed Grade 0.40 1.20 
Note: 


Best case reflects maximum operating voltage, minimum operating tem- 
perature, and best case processing. Worst case reflects minimum operating 
voltage, maximum operating temperature, and worst case processing. Best 
case derating is based on sample data only and is not guaranteed. 


- 
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Output Buffer Delays 


[ӘЙ To AC test loads (shown below) 

















tpzH toyz 

AC Test Loads 

Load 1 Load 2 
(Used to measure propagation delay) (Used to measure rising/falling edges) 
* GND 
To the output under test e 
/ 
50 pF R to Мос for teztez. 
| R to GND for terz/tezH 
== To the output under test R = 1kQ 
a 

Input Buffer Delays Module Delays 

s Fan-out = 2 
Fan-out = 2 A Y 











yera 








D-Type Flip-Flop and Clock Delays 
PRE 
D Q 
Fan-out = 2 
P CLR 


(Positive edge triggered) 


tsu = Setup time 
їн = Hold time (0 ns) 
tw = Minimum pulse width (10 ns) 





Hard Macro Library Overview 


The following illustrations show all the available Hard Macros in 
the ALS Macro Library. 


2-Input Gates (Module Count - 1) 
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3-Input Gates(Module Count = 1) 








4-Input Gates 
(Module Count = 1) 


ojojoj 





{Module Count = 2) 







AND4D 





- 
4 
0 








5-Input Gates (Module Count = 1) 


A C) A O O A е 

8-6 Y של---8‎ Y 0 Y an" Y 
8 AND5B E 0 O OR5B Б ₪ NANDSC [© 

E E E 


XOR Gates (Module Count = 1) XOR OR Gates (Module Count = 1) 


MOON» 








XOR AND Gates (Module Count = 1) 


> |= 


AND XOR Gates 
(Module Count = 1) 





A 


Op: Әә» yp: 


(Module Count = 2) 


A 


א 


nn‏ .ו 
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AND OR Gates (Module Count = 1) 
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OR AND Gates (Module Count = 1) 


O 


A 
B 
> : 
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Buffers (Module Count = 1) 


s 


I/O Buffers (I/O Module Count = 1) 

















Y 
PAD 
CLKBIBUF| 
Low-Slew Outputs 

D 

PAD 

igh- 
E 

D D 

PAD PAD 








ICTS) 








Buffers with Latches‏ 0/ו 









OBDLHS 








1-42 





ACT 2 FPGAs 





Multiplexors 
(Module Count = 1) 





(Module Count = 2) 














D Latches with Clear (Module Count = 1) 
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D Latches with Enable (Module Count = 1) 


D о D Q D а 
C] E DLEA E DLEB Q E DLEC 
G qa Qc 





Mux Latches (Module Count = 1) 





DLME1A 
A A A 
B 9 B о B Q 
DLM DLMA s 
s s ОЕ 
а OG OG 
Adders (Module Count = 2) 
A ₪0 A A 
B B B B 
co co CO сор, 
5 s sO s 
HA1 НА1А НА1В НА1С 





Macros FA1A, FA1B, and FA2A have two level delays 1гот the inputs to the S outputs, as indicated by the À. 





D Flip-Flops (Module Count = 1) 


D QN D 
DF1A 
> > 


Р 
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D Flip-Flops with Clear 
(Module Count = 1) 







(Module Count = 2) 





D Flip-Flops with Preset 
(Module Count = 1) 





(Module Count = 2) 
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D Flip-Flops with Enable (Module Count = 1) 


D Q D Q 
E DFEA C] E DFE1B 
D Cp > 


JK Filp-Flops (Module Count = 1) 














Toggle Flip-Flops (Module Count = 1) 





Mux Flip-Flops (Module Count = 1) 








CLKBUF Interface Macros (Module Count = 1) 





b indicates clock input 
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Package Pin Assignments 





(Top View) 
1 2 3 4 2 5 
AO OOO О Ola 
BIOO 00 О O|B 
cO OOO O Olc 
pO OOO © О [р 
Е ОООО О Ole 
FIOO OO О O | = 
8/0 0 0 0 O Ola 
HO 000 iesu] O Olx 
ОООО O O| 
KO 0 0 0 0 O!k 
LO OOO О | 
MO OOO OOOOOOO O!M 
NO OOO ооооооо О | א‎ 
POO OO OOOOOOO O |P 
RO OOO 0 ס ספ‎ 00 0 OJR 
1 2 3 4 6 7 8 9 10 11 12 5 
Signal Pin No. Location 
PRA or I/O 152 cg 
PRB or 1/0 160 D7 
MODE 2 c3 
SDI or VO 135 B14 
500 or I/O 87 P13 
DCLK or VO 175 B3 
CLKA or I/O 154 AQ 
CLKB or I/O 158 B8 
GND 1, 8, 18, 23, 33, 38, 45, 57, 67, 77, 89, D4, E4, G4, H4, K4, L4, M4, M6, M8, M10, M12, 
101, 106, 111, 121, 126, 133, 145, 156, 165 K12, J12, H12, F12, E12, D12, D10, C8, D6 
Усс 13, 24, 28, 52, 68, 82, 112, 116, 140, 155, 170 F4, НЗ, J4, М5, М8, M11, H13, G12, D11, D8, 5 
Vpp 110 414 
Vsv 25, 113 H2, H14 
Vis 109 J13 
Notes: 
1. Unused 1/О pins are designated as outputs by the Action Logic System 4. Vpp = Vcc, except during device programming. 


and are driven low. 
2. All unassigned pins are available for use as I/Os. 
3. MODE = GND, except during device programming or debugging. 6. Vxs = GND, except during device programming. 


5. Vsv = Vcc, except during device programming. 
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Package Mechanical Details 
Ceramic Pin Grid Array 
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Pin 1 


1.570" + .015” square 
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BOTTOM VIEW 





0.105” + .008” 
0.122” + .010” 


018" + .002” 


E 
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| 0.050" 


_ 


100” typ. 


0.130" + .010" 


+ 005" 





Determining Gate Density and 


Preliminary 
Applications 


Performance of the A1280 Note 





Introduction 


The ACTTM 2 family, introduced in 1990, is the next generation of 
field programmable gate arrays from Actel Corp. Based on Actel's 
patented channeled array architecture, the ACT 2 family provides 
significant enhancements to gate density and performance while 
maintaining fast user-programmable design flow. The family is 
implemented in silicon gate 1.2 um two-level metal CMOS, and 
employs Actel's PLICETM antifuse technology. Design support is 
provided for Sun™, Apollo@, and 386 PC platforms running 
standard CAE systems provided by Viewlogic@, Mentor 
Graphics, Valid™, and OrCADTM. The ACT 2 device family is 
upwardly compatible with the ACT 1 family; designs implemented 
in ACT 1 can be migrated easily to the larger capacity arrays. The 
architecture was described by Green, et al (1). This paper will focus 
on the A1280, the 8000-gate member of the family. Device capacity 
and performance will be compared to the ACT 1 family. 


Table 1 compares characteristics of two FPGA devices, the A1020 
from the ACT 1 family and the A1280 from the ACT 2 family. 
Logic capacity has increased from 2000 to 8000 gates while 
maintaining a high utilization factor. This is accomplished by 
scaling the interconnect resource: routing tracks and antifuses. 
Horizontal routing tracks/channel increase to 36 (vs 25 for ACT 1); 
vertical routing tracks/column increase to 15 (vs 13 for ACT 1). All 
speed-critical connections between modules are accomplished with 
only two low-resistance antifuse connections. Most connections 
are implemented with three or fewer antifuse elements. No 
connections are allowed with more than four antifuse elements in 
the path. The result is predictable performance combined with 
fully automatic placement and routing. A four-fold increase in 
capacity over ACT 1 is achieved with a 2.25 to 1 increase in total 
modules. This was made possible by an enhancement to the logic 
module architecture used in the ACT 2 family. This new module 
structure will be explained and the impact on device capacity and 
performance will be examined. 





Table 1 А1020 and А1280 Comparison 
ACT 1 ACT2 
A1020 A1280 
Capacity 
Gate array equivalent gates 2,000 8,000 
PLD/LCA™ equivalent gates 5,000 20,000 
Gate utilization 90% 80% - 90% 
CMOS Process 2.0 um 1.2 um 
Logic Modules 547 1232 
Module Utilization 90% 90% - 100% 
Routing 
PLICE elements 186,000 750,000 
Horizontal tracks/channel 25 36 
Vertical tracks/channel 13 15 


Gate Utilization 


Gate capacity utilization for ACT 1 arrays has been reviewed by 
McCarty (2). The total gate capacity can be expressed in terms of 
the average gate utilization per module times the total number of 
utilized modules. 


ACT 1 Usable Gates = gates/module x number of modules x percent utilization 


ACT 2 Logic Module Capacity 


The ACT 2 family employs a two-module architecture. Logic 
modules capable of implementing combinatorial logic 
(C-Modules) and modules capable of implementing both 
combinatorial and sequential logic (S-Modules) are provided. 
Figure 1 depicts possible implementations of the S-Module 
including the C-Module equivalent function. The C-Module is 
capable of implementing all functions of two variables, most 
functions of three and four variables, plus several functions of five, 
six, and seven input variables. The S-Module combines a 
combinatorial module with a sequential element. The S-Module 
can function as combinatorial logic plus a D-type flip-flop, as 
combinatorial logic plus a transparent latch, or simply as 
combinatorial logic. 


The distribution of C-Modules and S-Modules is approximately 
even on all ACT 2 arrays. The gate capacity of an ACT 2 array can 
be expressed as the combination of C-Module gate capacity times 
C-Module utilization plus S-Module gate capacity times S-Module 
utilization. This is summarized as: 


ACT 2 Usable Gates = [(gates/S-Module x number of S-Modules) + 
(gates/C-Module x number of C-Modules)] x percent utilization 


The gate capacity of a sequential module is equal to the gate count 
of the D-type flip-flop plus the number of combinatorial gates 
implemented in the S-Module. The average combinatorial logic 
that can be used by the sequential module is dependant on the type 
of logic circuit being implemented. Benchmark circuits have been 
used to quantify gate capacity in FPGA architectures, Osann (3). 
The benchmarks contain four basic types of logic functions: 
datapath, timer/counter, state machine, and arithmetic. The 
benchmark circuits chosen typify many applications. The capacity 
to implement a design can be estimated by counting the number of 
benchmark units that can be configured into a device. The 
benchmark capacity of the A1020 and the A1280 are compared in 
Table 2. The results indicate a 4X increase in gate capacity result 
from a 2.25X increase in logic modules and a 1.8X increase in 
average gates per module. 
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Up to 7-Input function plus D-flip-flop with clear. 


Up to 4-input function plus latch with clear. 


Up to 7-input function plus latch. 





Up to 8-input function (same as C-Module). 


Figure 1 S-Module Implementations 








Table2 Comparison of Benchmark Capacity 
Total Circuits Total Gates Gates/Module 
Benchmark А1020 А1280 А1020 А1280 ACT1 ACT2 
Datapath 12 54 1884 8478 35 6.88 
Arithmetic 6 18 1770 5310 32 4,31 
State Machine 9 4 1377 6273 25 496 
Timer/Counter 6 26 1488 8 27 5.23 
Performance 


System performance is determined by several factors: latching data 
on-chip, moving data through-chip and latching data off-chip. 
On-chip/off-chip performance in the ACT 2 family is facilitated by 
incorporating transparent latches in the input and output buffers. 
The user may select the latch to operate in either latched or 
transparent mode. The output latch is transparent high while the 
input latch is transparent low. This facilitates construction of 
synchronous pipelines between input pads and output pads. In 
addition two high-drive low-skew clock networks are provided. 
Each clock can be driven externally or from internal logic. 
Through-chip performance is the product of the average delay per 
logic level and the number of logic levels. 





Logic Compression 


In both ACT 1 and ACT 2 families, logic levels are replaced by 
module levels since multiple levels of logic can be implemented in a 
single logic module. A six-level logic path in a circuit implemented 
in a gate array may reduce to a four-level path when implemented 
in ACT 1. This reduction in delay levels is termed compression. 
Logic compression in the ACT 2 family is enhanced by the 
increased gate capacity of the C-Module and the flexibility of the 
S-Module. A comparison of logic compression for the А1280 vs the 
A1020 is shown in Table 3. 


The most significant performance gains are observed in the 
datapath benchmark while the arithmetic circuit produced no 
overall gain. The use of critical net assignment also can improve 
specific logic paths by 30%. 











Table 3 Compression Ratio A1280 vs. А1020 
Benchmark Compression (A1280 : A1020) 
Datapath 20:1 
Arithmetic 12:1 
State Machine 18:1 
Timer/Counter 13:1 
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Net Delay 


The delay distribution for typical module plus net delay for the 
A1280 is shown in Figure 2. The shape of the distribution is 
virtually identical to distribution curves for ACT 1 devices. The 
A1280 curve is shifted slightly to the right due to the 20% larger 
delay through the more complex ACT 2 module. Also, the tail of 
the A1280 curve is significantly reduced. This is due to 
three-antifuse connections for long vertical routing tracks in the 
ACT 2 family vs four-antifuse connections for long vertical tracks 
in the ACT 1 family. As a result, less than 1% of the nets have 
delays > 15 ns and all nets are < 25 ns. 





700 


600 


500 


400 Total Nets = 4000 


Nets 


300 


200 


100 


Delay (ns) 


Figure 2 А1280 Delay Distribution 


Conclusion 


Logic capacity and performance for the A1280 was compared to 
the A1020. A four-fold increase in gate density is provided by the 
A1280 as indicated by benchmark results from four different logic 
types. An average performance increase of 25% for the A1280 vs 
the A1020 is achieved through logic compression provided by the 
ACT 2 module architecture. Net delays for the A1280 maintain a 
tightly centered distribution similar to the ACT 1 family in spite of 
the increase in device size. In addition, the long routing tracks of 
the A1280 contribute less delay than those of the A1020 due to 
architectural improvements. 


References 
1. Greene, J., et al, 1990, Custom Integrated Circuits 
Conference Proceedings. 


2. McCarty, D., 1990, Electronic Engineering Times 
Programmable Logic Design Guide, 14. 


3. Osann, R., et al, 1988, Electron Design, Oct. 13, 93. 


1-51 











1-52 


ACT™ 1 Military 


Field Programmable 
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Features 


* Military Data Sheet Devices 


— Operation Over Full Temperature Range; 
-55 °C to + 125 °C, with +10% Voltage Supplies 


Military STD 883-C Class B Devices 

— Compliance to MIL-STD-883 paragraph 1.2.1 

— Assembled to Method 5004 

— Dynamic Burn-In 

— Qualification to Method 5005 

— SMD 5962-90964 (A1010) and SMD 5962-90965 (A1020) 
— Group A, B, C, D Test Reports Available 

e Packaging 

— B4-Pin Ceramic Pin Grid Array 

— 44-, 68- and 84-Pin J-Leaded Cerquad Chip Carrier 
— 84-Pin Ceramic Quad Flatpack 
Architecture/Technology 


— Antifuse-based architecture offers design flexibility and 
high gate utilization 

— Architecture and antifuse provide high security for 
customer designs 


— CMOS process offers low power consumption 
— Complete Test Coverage 
* Design Tools 


— Actel’s Action Logic" M System uses fully automatic place 
and route algorithms and offers timing analysis to military 
worst-case specifications 


— Interface to Viewlogic®, Mentor Graphics®, Valid™, and 
OrCAD™ 

High Reliability 

— Nonvolatile, Permanent Programming 

— Antifuse reliability (< 10 FITS) better than that of the 
base CMOS process 

— Very low product failure rate: 91 FITS — 0.091% failures 
per 1000 hours at 70°C ambient temperature (90°C 
junction temperature) 


— Radiation tolerant CMOS antifuse process: preliminary 
total dose tests better than 1.5 megarads (Si) 


Preliminary 


Benefits 


Actel’s ACT™ 1 field programmable gate array (FPGA) is an ideal 
device for military applications. Designs are easily implemented 
and programmed in minutes. This nonvolatile device offers a 
risk-free solution for almost any type of logic integration problem. 
The ACT 1 family includes the A1010 device with a capacity of 
1,200 gate array equivalent gates or 3,000 PLD/LCA™ gates, and 
the A1020 device with a capacity of 2,000 gate array gates or 6,000 
PLD/LCA gates. 


No cost risk — Once you have an Action Logic System, Actel's 
CAE software and programming package, you can produce as 
many chips as you like for just the cost of the device itself, no NRE 
charges to eat up your development budget each time you want to 
try out a new design. 


No time risk — After entering your design, placement and routing 
is automatic, and programming the device takes only about 5 
minutes for an average design. You save time in the design entry 
process by using tools that are familiar to you. The Action Logic 
System (ALS) software interfaces to popular CAE software such as 
Mentor Graphics, Valid, OrCAD, and Viewlogic and runs on 
popular platforms such as Apollo®, Sun™, and 386™ PC 
compatible machines. 


No reliability risk — The PLICETM antifuse is a one-time 
programmable, nonvolatile connection. Since Actel devices are 
permanently programmed, no down-loading from EPROM or 
SRAM storage is required. Inadvertent erasure is impossible and 
there is no need to reload the program after power disruptions. 
Both the PLICE antifuse and the base process are radiation 
tolerant. Fabrication using a low-power CMOS process means 
cooler junction temperatures. Actel’s non-PLD architecture 
delivers lower dynamic operating current. Our reliability tests show 
a very low failure rate of 91 FITS at 90?C junction temperature 
with no degradation in AC performance. Special stress testing at 
wafer test eliminates infant mortalities prior to packaging. 


No security risk — Reverse enginecring of programmed Actel 
devices from optical or electrical data is extremely difficult. 
Programmed antifuses cannot be identified from a photograph or 
by using a SEM. The antifuse map cannot be deciphered either 
electrically or by microprobing. Each device has a silicon signature 
that identifies its origins, down to the wafer lot and fabrication 
facility. 


No testing risk — Unprogrammed Actel parts are fully tested at 
the factory. This includes the logic modules, interconnect tracks, 
and I/Os. AC performance is assured by special speed path tests 
and programming circuitry is verified on test antifuses. During the 
programming process an algorithm is run to assure that all 
antifuses are correctly programmed. In addition, Actel's 
Actionprobe™ diagnostic tools allow 100 percent observability of 
all internal nodes to check and debug your design. 
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Ordering Information 


M 
ü L Application 
Package Lead Count 
Package Type 
Spo, or 


dien на 


А1020 

















Рап Number 
Part Numbers 
Part Description 
A1010 1200 Gates* 
A1020 2000 Gates 
* Gate array gates 
Packages 
Type Lead Count 
PG Ceramic Pin Grid Array 84 
JQ J-Leaded Cerquad Chip Carrier 44, 68, 84 
cQ Ceramic Quad Flatpack 84 
Applications 
M Military 
B 883 B 
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Actel Military Product Flow 














Military 
883C —Class B 883C— Class B 
Step סוסה‎ 883C Method Requirement ee 
1.0 Internal Visual 2010, Test Condition B 100% 100% 
2.0 Ternperature Cycling 1010, Test Condition C 100% 100% 
3.0 Constant Acceleration 2001, Test Condition E 100% 100% 
(min), Y1, Orlentation only 
40 Seal 1014 
a. Fine 100% 100% 
b. Gross 100% 100% 
5.0 Visual Inspection 100% 100% 
6.0 Pre Bum-in In accordance with Actel 100% N/A 
Electrical Parameters applicable device specifications 
7.0 Bum-in Test 1015 Condition D 100% N/A 
160 hours @ 125°C Min. 
8.0 Interim (post burn-in) In accordance with Actel 100% 100% 
Electrical Parameters applicable device specifications (es final test) 
9.0 Percent Defective Allowable 5% All Lots N/A 
10.0 Final Electrical Test In accordance with Actel 
applicable device specifications 
a. Static Tests 100% 100% 
(1) 25°C 
(Subgroup 1, Table I, 5005) 
(2) -55°C and + 125°C. 
(Subgroups 2, 3, Table |, 5005) 
b. Dynamic and Functional Tests 100% 100% 


(1) 25°C 
(Subgroup 7, Table І, 5005) 
(2) -55°C and + 125°C. 
(Subgroups 8A and 8B, Table |, 5005) 


c. Switching Tests at 25^C 10096 100% 
(Subgroup 9, Table 1, 5005) 





11.0 Qualification or Quality 5005 All Lots N/A 
Conformance Inspection Test 
Sample Selection (Group A) 


12.0 External Visual 2009 100% Actel specification 
етй: ee en 














Device Resources 






User I/Os 


Device Modules Gates] 44 ЈО 68 Ја 84JQ 8400 PG 


1200 34 57 МА М/А 57 
2000 34 57 69 69 69 


А1010 295 
А1020 547 








Absolute Махітит Ratings 
Free alr temperature range 


Symbol Parameter Limits Units 
Vee ОС Supply Voltage! -0.5to +7.0 Volts 
м Input Voltage -0.5t0 Vcc +05 Volts 
Vo Output Voltage -0.5 to Voc +0.5 Volts 























Ik Input Clamp Current +20 mA 
lox Output Clamp Current +20 mA 
lox Continuous Output Current +25 mA 





Tstg Storage Temperature -65 to + 150 °C 





Stresses beyond those listed under “Absolute Maximum Ratings” may 
cause permanent damage to the device. Exposure to absolute maximum 
rated conditions for extended periods may affect device reliability. Device 
should not be operated outside the Recommended Operating Conditions. 


Note: 
1. Vpp = Vcc, except during device programming. 


DC Characteristics 
Мс = 5.0 V + 10%; -55 °C < Tc < +125 °C 


Recommended Operating Conditions 














Parameter Milltary Units 

Temperature -55 to +125 °C 

Range (Tc) 

Power Supply 

Tolerance +10 %\Усс 
Note: 


Tc = Case Temperature 
Power Dissipation 


The following formula is used to calculate total device dissipation. 
Total Chip Power (mW) = 0.41 N*F1 + 0.17 M*F2 + 1.62 P*F3 
Where: 

F1 = Average logic module switching rate in MHz. 

F2 = Average clock pin switching rate in MHz. 

ЕЗ = Average I/O switching rate in MHz. 

M = Number of logic modules connected to the clock pin. 


N = Total number of logic modules used on the chip. 
(including M) 


P = Number of outputs used loaded with 50 pF 


The second term, variables F2 and M, may be ignored if the 
CLKBUF macro is not used in the design. 


Symbol Parameter Test Conditions FF PRA Units 
Subgroups 
v 


VoL Output Low Voltage 


Мос = 4.5 V, lg, = 4 MA 
Test one output at a time 


Vcc = 4.5 V, lon = -3.2 тА 


1,2,3 0.4 


Мон Output High Voltage Test one output at a time 1,2,3 3.7 v 
VL Input Low Level 1,2,3 -0.3 0.8 у 
Мн Input High Level 1,2,3 20 +8 V 
lee Standby Supply Current ua Ed Voc or GND 1,2,3 5 тА 
IL Input Leakage Current Veco = 5.5 V, Vin = Vcc or GND 1,2,3 -10 10 НА 
loz Output Leakage Current Усс = 5.5 V, Vour = Vcc or GND 1,2,3 -10 10 HA 
los Output Short Circuit Current 
Vout = Veo Test one output at a time 1,2,3 20 140 mA 
Vour = GND Усс = 4.5 V for min. limit -10 -100 mA 


Усс = 5.5 V for max. limit 





Switching Characteristics 
Voc = 5.0 V + 10%: -55 °C < Tc < +125 °C 





Symbol Parameter Test Conditions Group A Units 
Subgroups 
tepB Binning Circuit Delay 
A1010 Voc = 45V 9, 10, 11 120 ns 
A1020 Vin = ЗММ = OV 186 ns 


Уол = 15V 





1-56 





Functional and Switching Tests 


A1010 and A1020 devices can be tested functionally by using a 
serial scan test method. Data is shifted into the SDI pin, and the 
DCLK pin is used as a clock. The data is used to drive the inputs of 
the internal logic and I/O modules, allowing a complete functional 
test to be performed. The outputs of the modules can be read by 
shifting out the output response. All units are tested for 
functionality over the military temperature range. Group A 
subgroups 7, 8A, and 8B are tested in the same manner. 


AC timing for logic module internal delays is determined after 
place and route. The ALS Timer utility displays actual timing 
parameters for circuit delays. ACT 1 devices are AC tested toa 
“binning” circuit specification. 


Max. junction temp. (°C) - Max. military temp. (°C) 





ACT 1 Military FPGAs 


The circuit consists of one input buffer + n logic modules + one 
output buffer (n = 16 for the A1010; n = 28 for the A1020). The 
logic modules are distributed along two sides of the device, as 
inverting or non-inverting buffers. The modules are connected 
through programmed antifuses with typical capacitive loading. 


Package Thermal Characteristics 


The device junction to case thermal characteristic is 06, and the 
junction to ambient air characteristic is Өја. The thermal 
characteristics for Өја are shown with two different air flow rates. 


Maximum junction temperature is 150°C. 


A sample calculation of the maximum power dissipation for an 
84-pin ceramic pin grid array at military temperature is as follows: 


150°C - 125°C 











: = ——— = 0.76 Ww 
0ja (°C/W) 33°C/W) 
Package Type Pin Count Oe ва 200 tin, “nike 
Ceramic Pin Grid Array 84 8 33 20 *CN 
Cerquad J Leaded Chip Carrier 44 8 40 32 *C/N 
68 8 38 30 °см 
84 8 36 25 °C/W 
Output Buffer Performance Derating 
Sink Source 

10 10 

8 8 

3 š ° 

4 4 

2 2 

02 0.3 0.4 0.5 08 40 38 3.2 2.8 2.4 2.0 
Мо. (Volts) Мон (Volts) 


Worst-case values at 125°С, 4.5 V. 





Note: 


The above curves are based on characterizations of sample devices and are 
not completely tested on all devices. 
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Timing Characteristics and critical (speed-sensitive) nets are given below. Most nets will 
fall into the “typical” category. 


Less than 1% of all routing in a design requires the use of “long 
tracks.” Long tracks, long vertical or horizontal routing paths, are 
used by the autorouter only as needed. Delays due to the use oflong 
tracks range from 15 ns to 35 ns. Long tracks may be used to route 
the least-critical nets in a given design. 


Timing is design-dependent; actual delay values are determined 
after place and route ofthe design using the ALS Timer utility. The 
following delay values use statistical estimates for wiring delays 
based on 85% to 90% module utilization. Device utilization above 
95% will result in performance degradation. 


With ALS place and route programs, the user can assign critically 
level to a net, based on timing requirements. Delays for both typical 


Module Timing 
Voc = 5.0 V; Ta = 25 °C; Process = Worst Case 





Single Logic Module Macros 
(e.g., moat gates, latches, multiplexors)' 











Parameter Output Net FO=1 РО = 2 РО = 3 ҒО = 4 FO = Units 
tep Critical 70 75 8.0 11.0 Note 2 ns 
{рь Турїса! 8.2 87 10.0 11.2 14.0 ns 





Dual Logic Module Macros 
(e.g., adders, wide Input gates)' 








Parameter Output Net FO =1 FO = 2 FO =3 FO = 4 FO = 8 Unite 
teo Critical 12.0 125 13.0 16.0 Note 2 ns 
tep Typical 13.2 137 15.0 16.2 19.0 ns 





Sequential Element Timing Characteristics 











Parameter Unite 
tsu Set Up Time, Data Latches 46 5.0 5.4 5.8 5.8 ns 
tsu Set Up Time, Flip-Flope 5.0 5.0 5.0 5.0 5.0 ns 
їн Hold Time 0 0 0 0 0 пз 
tw Pulse Width, Minimum? 10.0 10.0 10.0 10.0 10.0 ns 
tco Delay, Critical Net 70 75 8.0 11.0 Note 2 ns 
tco Delay, Typical Net 82 87 10.0 11.2 14.0 ns 





1. Most flip-flops exhibit single module delays. 
2. Critical nets have a maximum fan-out of 6. 
3. Minimum pulse width, tw, applies to CLK, PRE, and CLR inputs. 
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ЏО Buffer Timing 
Veco = 5.0 V; Ta = 25 °С 
INBUF Macros 
Parameter From - To FO=1 FO = 2 FO =3 FO=4 РО = 8 Units 
teu Pad to Y 90 99 11.6 13.9 18.6 ns 
їын Pad to Y 77 8.4 10.0 10.9 16.1 ns 





CLKBUF (High Fan-out Clock Buffer) Macros 














Parameter FO = 20 FO = 50 FO = 200 Units 
א‎ 11 15 25 ns 
н. 12 16 25 ns 
Notes: 
1. A clock balancing feature is provided to minimize clock skew. 2. There is no limit to the number of loads that may be connected to the 
CLKBUF macro. 


OUTBUF, TRIBUFF & BIBUF Macros 








C = 50 pF 
Parameter From - To CMOS TTL Units 
teu D to Pad 5.1 6.4 ns 
tPLH D to Pad 9.4 7.4 ns 
нг E to Pad 67 44 ns 
tezH E to Pad 8.4 6.3 ns 
teiz E to Pad 8.9 67 ns 
tez E to Pad 6.4 7.7 ns 


Change In Propagation Delay with Load Capacitance 











Parameter From - To CMOS TTL Units 
teHL D to Pad 0.04 0.06 ns/pF 
tPLH D to Pad 0.09 0.05 ns/pF 
teHz E to Pad 0.10 0.06 ns/pF 
{рун E to Pad 0.09 0.05 ns/pF 
tez E to Pad 0.09 0.05 ns/pF 
tez. E to Pad 0.04 0.05 ns/pF 

Notes: 
1. The BIBUF macroinput section exhibits the same delaysas the INBUF 2. Load capacitance delay delta can be extrapolated down to 15 pF 
macro. minimum. 
Example: 


Delay for OUTBUF driving a 100-pF TTL load: 
tpuL = 6.4 + (0.06 * (100-50)) = 6.4 + 3.0 = 9.4 ns 
tpl = 7.4 + (0.05 * (100-50)) = 7.4 + 2.5 = 9.9 ns 





- 
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Timing Derating The derating factors shown below are based on the recommended 

operating conditions for ACT 1 military applications. The derating 
Operating temperature, operating voltage, and device processing curves below show worst-to-best case operating voltage range and 
conditions account for variations in array timing characteristics. best-to-worst case operating temperature range. 


These variations are summarized into a derating factor for ACT 1 
array typical timing specifications. 


Timing Derating Factor (xtypical) 
Best-case 0.40 


Worst-case 1.38 


Note: 

Best case reflects maximum operating voltage, minimum operating 
temperature, and best case processing. Worst case reflects minimum 
operating voltage, maximum operating temperature, and worst case 
processing. Best case derating is based on sample data only and is not 
guaranteed. 


Voltage Derating Curve Temperature Derating Curve 


Factor 





45 45 50 525 55 во -40 -20 O 20 40 80 80 100 120 
Voc (Volts) Temperature (°C) 





Output Buffer Delays 


8 To AC test loads (shown below) 
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AC Test Loads 
Load 1 Load 2 
(Used to measure propagation delay) (Used to measure rising/falling edges) 
Voc GND 
To the output under test ө ө 
50 pF R to Voc for teiz/tez, 
| R to GND for 12/1527 
== To the output under test R=1kn 


a 





Input Buffer Delays Module Delays 
s Fan-out = 2 
Fan-out = 2 А x 
B 
Y 


PAD 








teuH н. 





D-Type Flip-Flop and Clock Delays 


tsu = Set up time 
їн = Hold time (0 ns) 
tw = Minimum pulse width (10 ns) 











Package Pin Assignments 











(Top View) 
1 2 
ajo O 
воо 
соо 
pO כ‎ 
E||O O 
F||O О 
соо 
ноо 
Цоо 
кооооооооооо 
ı | OO000000000 
@ Orientation Pin (C3) 
Signal A1010 A1020 
PRA A11 A11 
PRB B10 B10 
MODE E11 E11 
SDI B11 B11 
DCLK C10 C10 
Vpp K2 K2 
CLK or I/O F9 F9 
GND B7, E2, E3, K5, F10, G10 B7, E2, E3, K5, F10, G10 
Усс B5, F1, G2, K7, E9, E10 B5, F1, G2, K7, E9, E10 
N/C (No Connection) B1, B2, C1, C2, K1, J2, L1, J10, K10, K11, B2 
C11, D10, D11 
Notes: 
1. Vpp must be terminated to Vcc, except during device programming. 3. Unused I/O pins are designated as outputs by the Action Logic System 
2. MODE must be terminated to circuit ground, except during device and are driven low. 
programming or debugging. 4. All unassigned pins are available for use as I/Os. 
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Package Pin Assignments (continued) 








72 


וד 
70 
69 
es‏ 
7 
6 
65 
4 
83 
62 
ot‏ 
60 
59 
56 
57 
56 
55 
54 





Notes: 
1. Vpp must be terminated to Vcc, except during device programming. 3. Unused I/O pins are designated as outputs by the Action Logic System 
2, MODE must be terminated to circuit ground, except during device and are driven low. 

programming or debugging. 4. All unassigned pins are available for use as I/Os. 








Package Pin Assignments (continued) 





Pin #1 > 
Index 84 83 82 B1 80 79 78 77 78 75 74 73 
ҮҮ 
МС AMET | 
| | 
з] 
4 [—ÉLLÁ— el 
5 ב‎ 


Notes: 
1. Vpp must be terminated to Vcc, except during device programming. 


2. MODE must be terminated to circuit ground, except during device 
programming or debugging. 


[MSN 


[E — ——23 $ PRB or I/0 


[E — —2323 GND 
Ip а 
2-9 
] + 
כ‎ — 
8 
= ————38 


OL 83 PRA or 1/0 
[LZ 62 DCLK or ио 
[E — ———3 €! SDI or 1/0 
60) 
DZI 59 
(0 % 
1 57 Vop 
[U J 59 Voo 
[LL 55 MODE 
הי‎ БЕ 
1 53 CLK or I/O 
Пе 52 
IU C 5⁄1 
П] 50 GND 

[ZV —— 46 GND 

] 4 


Vo Ë 


3. Unused I/O pins are designated as outputs by the Action Logic System 
and are driven low. 


4. All unassigned pins are available for use as I/Os. 
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Package Mechanical Details 
Ceramic Pin Grid Array 


.050" + .010” 
(4 ples) 
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@ Orientation Pin 
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Package Mechanical Details (contlnued) 
J-Leaded Cerquad Chip Carrler 
















TNT 






t| 
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+ 
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fr 


.105" + 010" IE 


.155” 
.180” 





Lead Count DE D1, EI 
.690" + .005” .650" + 008” 


.990" + .005" .950" + .008" 
1.190" + .005" 1.150" + .008" 
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Package Mechanical Details (continued) 
Ceramic Quad Flatpack 





0.058” + .010” 
Pin #1 
Index 
0.085” + .010” 
.010” + .002” 8 
.025” - _ 
.050" + .010" 
0.350” + .025" 
0.006" + .002” 
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Achte 


Action Logic" System 
FPGA Design Environment 





Features 


e Extensive 
Gate Array 
Macro Library 


e Supports Popular 
CAE Systems 


e Design Validation, 
Electrical Rules Check 


e Fully Automatic 
Place and Route 





e Menu-Driven Pin Editor 
for I/O Assignment 


e Timing Analyzer Inspects All Critical 
Paths and AC Specifications 
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Overview 


The Action Logic"! System is a high-productivity CAE 
environment for implementing logic functions with ACTTM 1 field 
programmable gate array devices. A menu-driven interface allows 
users to complete ACT 1 designs, from concept to silicon, within 
hours without costly NRE. The system includes a software design 
environment and an Activator™ programmer, tester, and 
debugger. Most popular CAE platforms are supported. Designers 
use their workstation to capture schematics, simulate, verify, place 
and route, perform timing analysis, program, and debug the chip. 
Figures 1 through 4 show the Actel interface to popular CAE 
design environments. Actel’s on-line help screens and reference 
manuals speed and simplify the design process. 


Gate Array Macro Library 


The Actel macro library contains the logic function building blocks 
necessary to create a design. The library includes macros ranging in 
complexity from simple gates to complex functions. Each macro 
has a graphic symbol or icon. Hard macros also contain placement 
information, a netlist, and timing data. 


Basic gates from the library are used to create soft macros, such as 
counters, adders, and decoders. The Actel library contains over 200 
different macros. 


Schematic Capture and Simulation 


Users enter their schematic using the Actel library. When 
schematic capture is completed, functional simulation is performed 
on the design. 


Pin Editor 


The Actel Pin Editor is an easy-to-use, menu-driven program for 
user assignment of logic I/Os and package pins. The editor 
automatically scrolls a list of all user-designated I/Os. To assign any 
pin to an I/O, the user enters the desired pin number next to the 
I/O node name. During each pin assignment, the Editor insures 
each pin is a valid package pin and is not already assigned to 
another chip I/O. 


Automatic I/O pin assignment is also available. 


Design Validation 


The Actel Design Validator examines an ACT design for 
adherence to design rules specific to the ACT device chosen for the 
design. Validation verifies routability and performs design rules 
checks prior to routing. The Validator produces error and 
information messages and system warnings regarding electrical 
rules violations such as excessive fanout, shorted outputs, or 
unconnected inputs. A warning is issued, for example, if a net 
exceeds a fanout of ten; unconnected module inputs produce an 
error message. 
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The Validator also provides statistical information about 
routability, logic module count, I/O count, average fanout per net, 
and array utilization. After passing through the validator, the 
design is free of electrical rules violations. 


Automatic Place and Route 


Fully automatic place and route software minimizes design delay by 
assigning macros to optimal locations in the chip. The system uses 
the design’s netlist, critical net information, and I/O assignments to 
automatically place and route all the logic blocks within the circuit. 
No manual intervention is required, even at high device utilization. 


The software provides the user with data on actual wire lengths, 
capacitive loading, and wiring congestion. The route program 
assigns the shortest possible net segments to connect the library 
macros with minimal delay, routing 100% of the nets automatically 
for 85% to 95% logic module utilization. 


Timing Analysis 


The timing analyzer (Timer) is an interactive tool that determines 
and highlights all critical and non-critical paths within a specified 
design. After the layout phase is completed, the Timer extracts 
accurate net delays. These delays are back-annotated to the netlist 


and evaluated with the timer or a CAE simulator. Using this 
information, the designer can optimize the design to meet timing 
specifications. The Timer accepts as input the design’s netlist and 
delay information. The user directs the type of analysis and output. 
Delay reports generated by the timing analyzer using post-route 
numbers provide the final AC specifications for the design. 


Device Programming and Functional Test 


Programming is controlled by the Activator programming system. 
Action Logic software generates a fuse map for the ACT device, 
which is used by the Activator for programming. A series of 
internal address registers are automatically loaded; they specify the 
programming element (PLICE™ antifuse) to be programmed. A 
programming sequence is then initiated, creating a permanent link. 
These steps are repeated until all interconnections are made. 


The Activator supports functional testing using an I/O test vector 
file. It also supports interactive circuit debugging. 


In-Circuit Test and Debug 


Once the device is programmed and functionally verified, it is ready 
for operation in the user's system. Actionprobe™ diagnostic tools 
may then be used to further evaluate circuit integrity. 





Action Logic System (ALS) 


Ordering Information 


Configuration 














Part Platform/ Viewiogic® Place & Route, Debugger, Activator 1 
Number CAE Host” Viewdraw® Timer Actionprobes Programmer 
ALS-113 386™ PC/ 
OrCAD™ x x x 
Viewlogic 
ALS-110 386 PC/ 
OrCAD x x x x 
Viewlogic 
ALS-031 Apollo™/ 
Mentor Graphics® x x x 
ALS-035 Apollo/ 
Mentor Graphics x 
ALS-041-S4 Sun-4™/ 
Valid x x x 
ALS-045-S4 Sun-4/ 
Valid x 
ALS-051-S4 Sun-4/ 
Viewlogic X X X 
ALS-055-S4 Sun-4/ 
Viewlogic x 
OPTIONS | 
6 High-Density Viewsim® for 386 PC 
ALS-017 Low-Density (up to 650 logic modules) Viewsim for 386 PC 
ALS-114, 134, 144-S4, 154-S4 ALES 1 (Boolean Entry and Optimization) (386 PC, Mentor Graphics, Sun-4) 
ALS-119 Activator 1 Programmer 
ALS-219, 239, 249-S4, 259-S4 Activator 2 Programmer (386 PC, Mentor Graphics, Sun-4) 
ALS-256-S4 Sun-4-based Workview® CAE 





“Includes schematic, symbol, and functional simulation libraries. 


Important: All systems and options are subject to availability. 
Please contact your Actel representative for current 
status. 
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Viewlogic Design Environment 


Viewdraw®: Viewlogic Netlist Unit Delay Post-Route 


Design Entry תא‎ Functional Simulation Simulation 


makeadl: Export 
Netlist Conversion Backannotation 


Validator: Place & Timer: Timing 
Design Validation Route Analysis 


Activator: Debugger & 
Device Actionprobe 


Programmer 
ALS Design Environment w ë 


Your design in an 
ACT FPGA 





Figure 1. ALS - Viewlogic Design Flow 


Mentor Graphics Design Environment 


NETED™ & SYMED™ Unit Delay Post-Route 
Design Entry Functional Simulation Simulation 


Export 
Makeadl: Backannotation 
Netlist Conversion 


Timer: Timing 
Analysis 


Validator: | Activator: Debugger & 
Design Validation Device Actionprobe 
Programmer 


ALS Design Environment 


Your design in an 
ACT FPGA 





Figure 2. ALS - Mentor Graphics Design Flow 
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OrCAD Design Environment 


OrCAD/SDT™: OrCAD/VST™: 
Design Entry Functional Simulation 
= OrCAD Netlist 4 


 orca2adi: | 


Netlist Conversion 


Timer: Timing 
Analysis 


Validator: Activator: Debugger & 
Design Validation Device Actionprobe 
00 


Your design оп 
an ACT FPGA 


ALS Design Environment 





Figure 3. ALS - OrCAD Design Flow 
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Figure 4. ALS - Valld Design Flow 
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Activator™ 2 


Programmer/Tester/Debugger 





Features 


e Supports ACTTM 1 and ACT 2 Device Families 


e SCSI Connectors Interface to 386™ PC, Sun™ and 

Apollo™ Workstations 

Adapter Modules for Each Package/Family Combination 

Up to Two Times Faster Than Activator™ 1 for ACT 1 

Programming 

e Supports Functional Verification with Actionprobe™ 
Diagnostic Pod 

e Simultaneously Programs a Single Pattern in Up To Four 
Identical Devices 

e In-Circuit Probing of Up To Four Devices Simultaneously 


Product Description 


Activator 2 is Actel’s state-of-the-art desktop programmer. It 
utilizes Actel’s Action Logic™ System (ALS) software and 
PLICE™ antifuse technology to program custom-engineered 
devices from Actel’s ACT 1 and ACT 2 device families. SCSI 
connectors, shipped with the unit, provide a convenient connection 
for PC 386, Sun, and Apollo workstation support. Customized 
adapter modules for each device type permit easy interchange of 
programmable devices. Programming, Test, and Debug execute up 
to two times faster than on the Activator 1; users can program up to 
four devices at one time. The accompanying diagnostic pod and 
ALS debug software support observation of all internal signals. 


The unit is software-driven, providing flexibility of application and 
a built-in barrier to obsolescence. Independently powered, the unit 
provides desktop device programming, functional test, and 
in-circuit debug. 


Preliminary 
Product 
Brief 


The unit supports different CAE platforms using SCSI connectors. 


The Activator 2 may operate with four different adapter modules 
inserted, or three of the sockets may be slaves to the first, 
permitting simultaneous programming of four identical devices. 
The Activator 2 then addresses any one of the sockets for 
programming. 


Activator 2 Base Unit 


The base unit contains the control board and the analog board. 
The LED display on the top of the programmer shows when the 
unit receives power. Four adapter ports located on the top of the 
base unit accept the different adapter modules for each device type. 
The adapter ports are identical, allowing interchangeability of 
modules. SCSI connectors are located on the back of the unit. 


The Adapter Module 


The adapter modules customize pin configurations for each device; 
the user need only switch adapter modules in order to program 
another device. All four ports may be in use simultaneously, when 
programming identical devices. The adapter modules are compact 
and sturdy. Any module may be used on any available port. 


The Diagnostic Pod 


The diagnostic pod supports ALS diagnostics and connects to the 
programmer via cable. The Activator permits the user to view any 
internal circuit activity through Actel’s on-chip diagnostic ports. 
Up to four pods may be used simultaneously. A six-foot cord 
connects the pod to the base unit and provides debugging 
flexibility. 
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ACT™ 1010/1020 


By Steve Chiang 


Reliability Report and 





ACT™ 1010/1020 devices are 1200- and 2000-gate (respectively) 
field programmable gate arrays (FPGAs). The programming 
element is an Actel-invented PLICE™ (Programmable 
Low-Impedance Circuit Element) antifuse. An antifuse is a 
normally open device in which an electrical connection is 
established by the application of a programming voltage. Although 
ACT 1010/1020 products are one-time programmable devices, 
their unique architecture features complete functional testability. 


The ACT 1010/1020 device is processed using a standard 2 um, 
double metal, CMOS process to which three additional masking 
steps have been added to implement the PLICE antifuse. A 
description of the main process parameters is shown in Table 1. 
Because ACT 1010/1020 devices are manufactured with a 
conventional CMOS process, normal CMOS failure modes will be 
observed. However, the addition of the antifuse adds another 
structure that could affect the device’s reliability. 


Actel has completed numerous studies in order to quantify the 
reliability of the antifuse. These studies lead to the conclusion that 
the time to failure of the antifuse is substantially more than 40 years 
under normal operating conditions and that the combined 
contribution of all antifuses to the gate array product’s hard failure 
rate is less than 10 FITS (Failures-in-Time or 0.001% failures per 
1000 hours). 


Table 1. ACT 1010/1020 Process Description 
CMOS, 2 um, double metal, dual polysilicon, dual well, EPI wafer. 











Dimensions 

Width Space 
N+ 4.0 um 2.0 um 
P+ 40 2.0 
Cell Polysilicon 1.6 3.6 
Gate Polysilicon 1.6 2.4 
Metal I 4.0 2.0 
Metai Il 4.2 28 
Contact 18x18 2.0 
Via 2.0x 2.0 2.0 
Thickness 
Normal Gate Oxide 25 nm 
High Voltage Gate Oxide 40 
Cell Polysilicon 35 
Gate Polysilicon 40 
Metal I 80 
Metal 11 100 
Passivation 1100 
Composition 
Metal I 98% Al, 1% Si, 1% Cu 
Metal Il 98% Al, 1% Si, 1% Cu 
Passivation 300 nm 5102, 800 nm SiN 
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PLICE Antifuse Reliability 


The antifuse is a vertical, two-terminal structure. It consists of a 
polysilicon layer on top, N+ doped silicon on the bottom, and an 
ONO (oxide-nitride-oxide) dielectric layer in-between. A Scanning 
Electron Microscope (SEM) cross-section of the antifuse is shown 
in Figure 1. On the ACT 1010/1020 device, the size of the antifuse is 
18 um?. This small size, along with a low programmed 
on-resistance, typically 500 Q, makes the PLICE antifuse a very 
attractive alternative to EPROM, EEPROM, or RAM for use asa 
programming element in a large programmable gate array. In the 
unprogrammed state, the resistance of the antifuse is in excess of 
100 Mà. The ACT 1010 and ACT 1020 contain 112,000 and 
186,000 antifuses respectively. However, typical applications 
utilizing 85% of the available gates require programming only 2% 
to 3% of the available antifuses. 































































































Antifuse Poly- 


Dielectric < 






















































































































































































































































































Figure 1. SEM Cross-Section of Antifuse 





The Unprogrammed Antifuse 


In order to evaluate antifuse reliability, Actel has developed 
models and collected data for both unprogrammed and 
programmed antifuses! ?. We'll consider the unprogrammed 
antifuse first. Since the antifuse is a dielectric sandwiched between 
polysilicon and silicon, the model for its reliability, in the 
unprogrammed condition, is the same as that used to evaluate 
reliability of MOS transistor gate oxides. The parameter we wish to 
evaluate is the time to breakdown (tsa) of the dielectric. This 
parameter is a function of the electric field across the dielectric as 
well as temperature and has the following relationship?. 
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tsa = to - exp (G/E) (1) 


where tsa is the time їо breakdown іп seconds, to is а constant, E is 
the electric field in MV/cm, and G is the field acceleration factor in 
MV/cm (G is a function of temperature). 


By taking the log of both sides of equation 1 we have: 
In (tea) = G - (VE) + In (to) (2) 


From experimental data, we can plot the log of the time to 
breakdown of the antifuse at various temperatures versus the 
reciprocal of the electric field across it and derive G from the slope 
and t; from the Y axis intercept. Actel has done this both on large 
antifuse capacitors and on arrays of 28,000 antifuses. An example is 
shown in Figure 2. From this we have derived a value for G of 510 
MV/cm and a to of 1 x 10726 seconds. Also note the difference 
between the data at 25°C and 150°C. 











LN (Time to breakdown in seconds) 


78 78 8 8.2 
100/E (cm/MV) 





8.4 8.6 68 9 


Figure 2. Fleld Acceleration of Antifuse (0.04 mm? Area Capacitor) 





In order to quantify the temperature-dependence of the time to 
breakdown, we use the Arrhenius equation to determine the 
reaction rate (or semiconductor failure rate) of a given process 
(failure mode) over temperature: 


R = Ros exp E” (3) 


where R is the reaction rate, Ro is a constant for a particular 
process, T is the absolute temperature in degrees Kelvin, k is 
Boltzmann’s constant (8.62 x 10° eV/°K), and E, is the activation 
energy for the process in electron volts. To determine the 
acceleration factor for a given failure mode at temperature T; as 
compared with temperature T, we use equation 3 to derive: 


A (Т Т) = exp ]- (E,/k) - (VT1) - (UT2)}] (4) 
where A is the acceleration factor. 

For a given time to breakdown of a dielectric, the expression, 
E, = k. din (уд (UT) (5) 


gives us the activation energy’. The field acceleration factor, G, is 
also temperature-dependent, i.e. 








G(T) = G . [1 + 8k. (VT - 1/298([ (6) 


where 8 (in eV) characterizes the temperature-dependence of G. 
E, can be related to G (T) by: 


E, = б. 5/Е – E, @ 


where 8 and E, are treated as fitting parameters between E, 
and G. 


From the data shown in Figure 2, as well as data on test arrays of 
28,000 antifuses, we have derived a value for E, of 0.2eV. This value 
of E, = 0.26% is for a very high E field of 11 MV/cm, or 10 V across 
the antifuse. With 5.5 V, the E field is about 6 MV/cm and E, is 
approximately 0.6 eV. Values of & and E, were found from 
equation 7 to be 0.01 eV and 0.24 eV, respectively. 


By combining equations 1, 5, 6, and 7, we obtain an overall equation 
for the time to breakdown for a given temperature and E field: 


tog = to + exp ((G/E) [1 + (5%) « (UT - 1/298)] - (БЫК). 
(UT - 1/298([ (8) 











By applying the values for the constants as defined above, the time 
to breakdown for the antifuse dielectric can be derived for a given 
temperature and electric field. In Table 2, we have used equation 8 
to solve for the acceleration factors associated with powering up a 
device at high voltage and/or temperature and comparing the 
failure rate with more typical voltages or temperatures. We can see 
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factor of 36, or 4.1 equivalent years for a 1000-hour burn-in at 
125°C. Note that this acceleration factor of 36 is close to the value 
of 41.8 derived from the Arrhenius equation (equation 4) using an 
activation energy of 0.6 eV and the same temperatures. We use 0.6 
eV as a general semiconductor failure mode activation energy 
when calculating failure rates from high-temperature operating life 


the effect of temperature by comparing 125°C at 5.5 V with 55°C at (HTOL) later in this report. 


5.5 V in which the Actel model (equation 8) gives us an acceleration 


Table 2. Acceleration Factor vs. Operating Conditions (Unprogrammed Antifuse) 
to = 1х 10-16%. G = 510 MV/cm, 5 => eV. 











Temperature/Voltage Equivalent Years 

Acceleration for 1000-Hour 

Model High Typical Factor 125°C Burn-In 
125°С/5.5 V 55°C/5.5 V 36.0 4.1 
Fixed Voltage 125°C/5.5 V 95°C/5.5 V 3.9 0.4 
25°С/5.5 V 25*C/5.25 V 48.7 5.6 
Fixed Temperature 25?C/5.75V 25*C/5.25 V 1692 193.2 
25°C/5.75 V 25° C/5.5 V 347 4.0 
125°C/5.5 V 55°C/5.25 V 1526 174.2 
Varied Ternperature and Voltage 125°C/5.75 V 55°С/5.5 V 883 100.8 
125°С/5.75 V 95°С/5.5 V 96.5 11.0 
Fixed 0.6 eV 125°C/5.5 V 55°C/5.5 V 41.8 4.8 
(Activation Energy), 125°C/5.5 V 95°С/5.5 V 4.2 0.5 

Voltage-Independent 





We can also see from Table 2 that a small change in voltage is a 
much more effective reliability screen for the unprogrammed 
antifuse than is a change in temperature. For example, if we 
compare 25°C at 5.75 V to 25°C at 5.25 V we see that just a half volt 
change yields an acceleration factor of 1692, or 193.2 equivalent 
years per 1000 hours at 5.75 V. This strong dependence on voltage 
allows Actel to screen out antifuse infant mortality failures during 
normal wafer sort testing at Actel simply by performing a special 
test in which a higher than normal voltage is applied across all 
antifuses. Because antifuse infant mortality failures can be detected 
and effectively screened, ACT 1010/1020 devices have as high a 
level of reliability as standard CMOS processed products. 


Actel has collected data on over 350 antifuse test devices 
representing eleven wafer runs. From this data, we have 
determined that the contribution of the antifuse to overall device 
reliability is less than 10 FITS. This conclusion is confirmed by the 
reliability data taken on actual ACT 1010/1020 units which will be 
discussed later in this paper. 


The Programmed Antifuse 


A Kelvin test structure as shown in Figure 3 was used to evaluate 
reliability of a programmed antifuse. Here, a strip of polysilicon 
crosses an N+ diffusion. The antifuse is located at their 
intersection. There are metal-to-poly contacts at nodes 1 and 3 as 
well as metal-to-N+ contacts at nodes 2 and 4. A four-terminal 
Kelvin structure is useful should a failure occur, because antifuse 
opens can be separated from other problems (such as polysilicon or 
contact opens) simply by checking for continuity on appropriate 
pairs of nodes. 


Test devices were stressed by forcing a constant 5 mA current from 
polysilicon to N+diffusion through the antifuse at 250°C. Note 
that this stress is far greater than what a programmed antifuse 
would see in a device operating under normal conditions. Because 
the antifuse is used to connect two networks together, there is 
usually no voltage across it, hence no current passes through. A 
voltage will appear across the antifuse only momentarily while a 
network switches from low-to-high or high-to-low. 





N+ 
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— 
GND 


+V 
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Figure 3. Antifuse Kelvin Structure 
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During the 5 mA, 250°C stress, the voltage across the antifuse was 
monitored. Figure 4 is a plot of the monitored voltage as a function 
of stress time. A sudden increase in voltage indicates that an open 
occurred. As can be seen from the figure, failures occurred at about 
300 hours of stress. However, by probing on nodes 3 and 4 of the 
Kelvin structure, we were able to measure continuity and 
determine that the cause of failure was not the antifuse. The failed 





Antifuse Accelerated Lifetest 
250°C 5 mA Test 


units were then examined on an SEM, where the cause of failure 
was revealed as metal-to-poly contact electromigration. This is a 
well-known failure mode in CMOS, which has been determined to 
have an activation energy of 0.9 eV. Using equation 4 we can predict 
a lifetime under normal operating conditions in excess of 40 years 
for this failure mode. The lifetime of the programmed antifuse is 
even longer. 
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Figure 4. Voltage Across Antifuse versus Stress Time 





ACT 1010/1020 Device Reliability 


Device reliability was evaluated on four Actel products: a 64K 
PROM (PROM64), a 300-gate FPGA (1003), a 1200-gate FPGA 
(ACT 1010/1010A), and a 2000-gate FPGA (ACT 1020/10204). 
The PROM64 product uses the same process and antifuse as the 
ACT 1010/1020. The 1003 is a test device, a smaller version of the 
ACT 1010/1020, which was used for early characterization and 





qualification. The ACT 1010A and ACT 1020A are 20% linear die 
shrink versions of the ACT 1010 and ACT 1020, respectively. The 
PROM64 units were packaged іп 24-pin side-brazed packages; the 
FPGA units were in 68- and 84-pin JLCC (ceramic J-leaded chip 
carriers) and PLCC (plastic-leaded chip carriers) packages. 
Package characteristics for ACT 1010/1020 devices are shown in 
Table 3. 





Table 3. ACT 1010/1020 Package Characteristics 
PLCC 
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Molding Compound 

Filler Material 

Lead Frame Material 
Lead Plating Composition 
Die Attach Material 


Sumitomo 6300 H 

Fused silicon 70% by weight 

Copper 

Tin or solder, 300 to 800 micro inches (uin) 
Silver Epoxy 











Die Coat Dow Corning 014939 silicon gel 

Bond Wire Gold, 1.3 mil diameter 

Bond Attach Method Thermosonic 

JLCC 

Body Material Ceramic 

Lid and Lead Material Alloy 42 with minimum 60-uin gold plate 

Bond Wire 99% Aluminum, 1% Si, 1.25 mil diameter 

Bond Attach Method Ultrasonic 

Thermal Resistance (°C/Watt) 

Package 44 PLCC 44 JLCC 68 PLCC 68 JLCC 84 PLCC 84 JLCC 84 PGA 
Өс 15 5 13 5 12 5 8 
Oja 52 38 45 35 44 34 35 





High Temperature Operating Life (HTOL) Test 


The intent of HTOL is to dynamically operate a device at high 
temperature (usually 125°C) and extrapolate the failure rate to 
typical operating conditions. This test is defined by Military 
Standard-883 in the Group C Quality Conformance Tests. The 
Arrhenius relationship in equations 3 and 4 is used to do the 
extrapolation. To use the Arrhenius equation, we need to know the 
activation energy of the failure mode. Activation energies of 
antifuse failure modes were discussed earlier. Table 4 gives the 
activation energies of general semiconductor failure modes. 


Table 4. Activation Energy of CMOS Fallures 


Fallure Mechanism 





Activation Energy 





lonic Contamination 1.0 eV 
Oxide Defects 0.3 eV 
Hot Carrier Trapping in Oxide -0.06 eV 
(Short Channels) 

Silicon Defects 0.5 eV 
Aluminum Electromigration 0.5 eV 
Contact Electromigration 0.9 eV 
Electrolytic Corrosion 0.54 eV 





Six different data patterns were programmed into the 64K PROMs 
for HTOL testing: a diagonal of zeros (98% programmed); a 
diagonal of ones (2% programmed); a topological checkerboard 
pattern (50%); all zeros (100%); all ones (0%); and an incrementing 
pattern (50%). During burn-in, all addresses are sequenced 
through ata 1 MHz clock rate. The outputs are enabled and loaded 
with a 100 ohm resistor to a 2 V supply. This results in an output 


loading of equal to or greater than the data sheet specified limits of 
Тон = -4 mA and Ip, = 16 mA. In most cases, the PROMs were 
burned-in at Vcc = 5.5 V, 125*C. However, voltage acceleration 
experiments were also done at 7 V, 125°C as well as at 8 V, 25°C. 


The PROM is useful for antifuse reliability studies for several 
reasons. First of all, we can program anywhere from 046 to 10046 of 
the antifuses although we program only 2% to 3% of the antifuses 
for a given design on the ACT 1010/1020 device. Also, an antifuse 
failure on the PROM is very noticeable, since the antifuse is 
directly addressed. A weak antifuse would show an AC speed drift, 
and a failed antifuse would read the wrong data. 


To evaluate the ACT 1003/1010/1020 devices, we programmed an 
actual design application into each device and performed a 
dynamic burn-in by toggling the clock pins at a 1 MHz rate. The 
designs selected utilized 85% to 97% of the available logic modules 
and 85% to 94% of the I/Os. Outputs were loaded with 1.2 ka 
resistors to Vpp resulting in greater than 4 mA of sink current as 
each I/O toggled low. Under these conditions, each ACT 1010 
typically draws about 100 mA during dynamic burn-in. Most of this 
current comes from the output loading while about 5 mA is from 
the device supply current. The thermal resistance (junction to 
ambient) of the 68- and 84-pin PLCC packages is about 45°C/Watt; 
for the JLCC packages it is about 35? C/Watt. For a 125°C burn-in, 
this results in junction temperatures of about 150*C for plastic 
packages and 145°C for ceramic packages. Most burn-in was done 
at 5.75 V (for voltage acceleration of the antifuse) and 125°C. Some 
data was taken at 5.5 V, 125°C and at 5.5 V, 150°C. 


Asummary of the HTOL data collected by Actel is shown in Table 
5. A failure is defined as any device which shows a functional 
failure, exceeds data sheet DC limits, or exhibits any AC speed 
drift. Among the parts tested, no speed drift, faster or slower, was 
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observed within the accuracy of the test set-up. Failure rates at 
55*C and 70°C were extrapolated by using the Arrhenius equation 
and a general activation energy of 0.6 eV. Poisson statistics were 
used to derive a calculated failure rate with a 6076 confidence level. 
Use of Poisson statistics is valid for a failure rate which is low and a 
failure mode which occurs randomly with time. At 55°C, the 
calculated failure rate with a 60% confidence level was found to be 
33 FITS (0.003396 failures per 1000 hours) This number was 
derived from over 2.2 million device hours of data. 


Table 5. HTOL (High Temperature Operating Life) Test 


Device Hours 


Device Units Rune at 125°C 
PROM64 275 4 568,000 
1003 JLCC 238 3 359,400 
1010 JLCC 144 6 283,000 
1020 JLCC 61 2 90,000 
1020A JLCC 69 2 69,000 
1010 PLCC 496 6 844,000 
1020 PLCC 32 2 48,000 
Totals: 1315 20 2,261,400 
Summary: 

Observed FITS at 55°C: 21 
Observed FITS at 70°C: 53 
Calculated FITS to 60% confidence at 55°C: 33 
Calculated FITS to 60% confidence at 70°C: 83 


(Summary of Data as of February 20, 1990) 


Unblased Steam Pressure Pot Test 


This test is used to qualify products in plastic packages. Units are 
placed in an autoclave (pressure pot) and exposed to a saturated 
steam atmosphere at 121°C and 15 psi. Problems with bonding, 
molding compounds, or wafer passivation can cause metal 
corrosion to occur in this atmosphere. Metal corrosion is detected 
during a full electrical test of the device following exposure to the 
autoclave environment. 


Table 6. Unblased Steam Pressure Pot Test 


Both of the observed failures were normal CMOS failures and were 
not caused by the antifuses. The 1003 device failed after 80 hours at 
150°C. The unit was functional but had high Ipp current (40 mA vs. 
4mA typical). Liquid crystal analysis revealed a hot spot outside the 
antifuse area of the chip. The other failed unit was nonfunctional, 
with a high Ipp current of about 40 mA. This unit failed after 500 
hours at 125°C. Both failures are believed to be the results of 
junction degradation or silicon defects. 


Equivalent Equivalent 
Device Houre Device Hours 
Fallures at 55°C (0.6 eV) at 70°C (0.6 eV) 
0 23.8 Million 9.4 Million 
1 15.0 59 
0 11.8 47 
0 3.8 1.5 
0 29 1.1 
1 35.3 14.0 
0 2.0 0.8 
2 94.6 Million 37.4 Million 


A total of 426 units from five wafer runs and six assembly lots were 
used. Read points were taken at 96, 168, and 336 hours. There were 
a total of four failures (Table 6). All four failures were caused by 
bond wires lifting off bond pads. This was an assembly problem 
that occurred only on our first lot of plastic units. The failures were 
caused by high temperature, not by metal corrosion. The assembly 
problem was corrected, with no further failures observed. 





121°C, 15 psi 
Number of Fallures 
Product Run Number Package Number of Units 96 Hours 168 Hours 336 Hours 
1010 JB13 84 PLCC 34 0 3 
1010 JB13 68 PLCC 71 1 0 0 
1010 JB14 68 PLCC 71 0 0 0 
1010 JB22 68 PLCC 71 0 0 0 
1010 JB27 68 PLCC 50 0 0 0 
1010A TI24 68 PLCC 129 0 0 0 








Blased Molsture Life Test (85/85) 


In this test, the units are placed in a chamber at a temperature of 
85°C and a relative humidity of 85%. A voltage of 5.5 V is applied 
to every other device pin while other pins are grounded. 5.5 V is 
applied to Vpp while Vss is grounded. This test is effective at 
detecting die related and plastic package related problems. 


Table 7. Biased Molsture Life Test 
85°C/85% Humidity with DC Alternate Pin Bias of 0 V to 5.5 V 


ACT 1010/1020 Rellability Report 


Asshown in Table 7, a total of 288 units were stressed. There were 
three failures. One failure was caused by two lifted bond wires; it 
was from the same lot in which we saw failures in steam pressure 
pot test. The second unit was functional but had high Ipp current. 
The 1000-hour failure was nonfunctional. 





Run Number Package 








1010 JB13 68 PLCC 
1010 JB14 68 PLCC 
1010 JB22 68 PLCC 
1010 JB26 68 PLCC 
1010 JB27 68 PLCC 


Temperature Cycling 
This test checks for package integrity by cycling units through 
temperature extremes. For ceramic packages, the range of 


Table 8. Temperature Cycling Test 
-65°C to 150°C Ceramic; 0°C to 125°C Plastic 


Number of Units 









Number of Fallures 


500 Hours 2000 Hours 





1000 Hours 





оо о о 
оороо = 
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temperature is -65°C to 150°C. For plastic packages, the range is 
0°C to 125°C. Both programmed and unprogrammed units are 
placed on temperature cycle. Data on 451 units is summarized 
in Table 8. 








Fallures 
Product Run Number Package Number of Units 100 Cycles 200 Cycles 1000 Cycles 

1010 JB13 68 PLCC 158 0 _ 0 

1010 JB14 68 PLCC 28 0 _ 0 

1010 JB26 68 PLCC 21 0 - 0 

1010 JB28 68 PLCC 31 0 = 0 

1020 JB22 84 PLCC 17 0 _ 0 

1010 1077 84 JLCC 20 0 0 

1020 JB33 84 PGA 25 0 

1010A TI24 68 PLCC 176 0 = 0 

Other Tests grounded, and all other I/Os grounded. Thus each pin received a 


Electrostatic Discharge (ESD) 


Units were tested for sensitivity to static electricity by using the 
human body model as described in MIL-883C (100 pF discharged 
through 1.5 kQ). Fifteen ACT 1010 units from three wafer runs 
were tested. Nine representative I/O pins were checked on each 
device. Since all I/O pins have the same layout on the chip, the nine 
pins tested were selected based on their proximity to Vss, Vpp, or 
the corner of the chip. The MODE pin was also tested because it is 
the only dedicated input on the chip. In addition, the three power 
supplies (Vss, Мор, Vpp) were tested. Three positive and three 
negative pulses were discharged into each pin tested at each voltage 
level. For inputs and I/Os, these six pulses were applied with three 
different grounding conditions; Vss only grounded, Vpp only 


total of eighteen pulses for each test voltage. Testing began at 1000 
V and continued in 500 V increments. After pulsing was completed 
at each voltage, the I-V characteristic of each pin was checked ona 
digital curve tracer. Any significant change in the I-V curve from 
the previous reading was considered a failure. The units were then 
tested on a VLSI tester. Leakage currents were datalogged at 0 V 
and 5.5 V. Any pin showing more than 250 nA of leakage current 
was also considered to be a failure. For Ipp, a change of more than 
250 uA was cause for rejection. No failures occurred through 2000 
V. At 2500 V, five of the fifteen units failed on at least one pin. 
Failure analysis revealed that the failures occurred on the 
N-channel pulldown transistor of the output driver. With no 
failures through 2000 V testing, the ACT 1010 (and the ACT 1020, 
by virtue of identical I/O layout to the 1010 device) met the 
requirements for the 2000 V ESD category of MIL-883C. 
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Latch-up 


Latch-up is a well-known cause of failure in CMOS circuits. 
Parasitic bipolar transistors are created by the P-channel transistor, 
the N-channel transistors, the N-well, and the P-substrate. These 
transistors are connected in a manner which effectively creates an 
SCR. If a voltage on an external pin were to forward bias to the 
substrate, the parasitic SCR can be latched to the on state, creating 
a low-impedance path between V pp and ground. A large amount of 
current then flows through this path. This current can, at best, 
make the device temporarily nonfunctional and, at worst, cause 
permanent damage. 


Several techniques are used by CMOS designers to reduce the 
chance of latch-up. One of the most common techniques is the use 
of guard rings to isolate P-channel and N-channel transistors. The 
disadvantage of this method is that it requires additional silicon die 
area. Another method is to use a substrate bias generator. Creating 
a negative substrate bias means that an input must go even more 
negative to cause latch-up. A third technique is to use EPI wafers to 
achieve low substrate resistance, which lowers the chances of 
triggering latch-up. Actel designers use both guard ring and EPI 
wafer techniques for ACT 1010/1020 devices. 


The latch-up test method used is defined by JEDEC Standard No. 
17. Each YO pin on a tested device was forward biased in both 
directions (to Vss and Vpp) by forcing negative and positive 


currents ranging from +50 mA to +250 mA in 50 mA increments. 
Following each stress, the device Ipp current was measured. If the 
current exceeded the data sheet limit of 10 mA, the unit would be 
rejected. The device was also functionally tested. 


Fifteen units from three different wafer lots were tested. Testing 
was done both at room temperature and at a worst case 
temperature of 135°C. All device I/Os and power supplies were 
tested. No failures were detected through 250 mA. 


Conclusion 


The data presented in this report establishes the excellent 
reliability of Actel ACT 1010/1020 devices. Both the Actel models 
and the test devices show that the antifuse is highly reliable and that 
it detracts negligibly from overall product reliability. 
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Determining which array to use is an important part of design 
implementation. This applications note shows how to easily select 
the correct Actel АСТ?М 1 array for your design. 


The ACT 1 family of gate arrays contains the ACT 1010, a 1200- 
gate array, and the ACT 1020, a 2000-gate array. 


The ACT 1010 contains 295 logic modules and 57 I/O modules. 
The ACT 1020 contains 547 logic modules and 69 I/O modules. 
Logic modules can implement a variety of logic functions, such as 
gates, flip-flops, and latches. An I/O module can be designated as 
an input, output, three-state, or bidirectional buffer. 


Each array contains a master clock buffer. This buffer can drive 
flip-flops or latches without fan-out restriction. 


To determine the correct device for your design, first consider I/O 
requirements — including simultaneously switching outputs (SSOs) 
—then consider logic requirements. 


In determining I/O requirements, consider the package 
limitations, as shown in the table below. Power and ground are not 
included in the limits given; separate bond pads and package pins 
are provided for power and ground. SSOs are outputs that switch 
within 10 ns of each other. 





I/O Limits 
ACT 1010 ACT 1020 
Package Ose SSOs Os SSOs 
44-pin chip carrier 34 16 34 16 
68-pin chip carrier 57 24 57 24 
84-pin chip carrier N/A N/A 69 32 
84-pin grid array 57 32 69 32 





To calculate the number of logic modules required by a design, add 
up the individual logic module requirements of each macro used. 
The following table gives typical logic module requirements for 
various hard macro functions. Consult the Actel Macro Library for 
a complete listing of macro types and actual logic module 
requirements. For calculation purposes, convert complex functions 
(e.g, 74163) into gate and flip-flop schematic equivalents. Refer to 
the manufacturer’s databook for this information. 


Which Actel array should 


Applications 
Note 


Note that both true and inverting inputs are provided on all 2-, 3-, 
and 4-input functions, so inverters are rarely needed in Actel 
designs. 


Design limits for Actel devices are 295 logic modules for the ACT 
1010, and 547 logic modules for the ACT 1020. But device 
utilization, the percentage of available modules used, must be 
considered also. Designs using 85% or less of available logic 
modules can be placed and routed easily. Device utilization above 
85% may lead to problems, especially when designing for fast AC 
performance. 


Logic Module Requirements 














Macro Function Logic Modules 
2-input gate 1 
3-input gate 1 
4-input gate 2 
XOR gate 1 
XOR OR gate 1 
XOR AND gate 1 
AND XOR gate 1 
AND OR gate 1 
OR AND gate 1 
Buffer/Inverter 1 
2- or 4-input Muitiplexor 1 
Adder 2 
D-latch 1 
D-latch with clear 1 
D-latch with enable 1 





Multiplexed latch 1 
D flip-flops 2 
Muttiplexed D flip-flops 2 





D flip-flop with enable 
JK flip-flop 


N № 
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Efficient Logic Conversions 





Introduction 


In order to obtain the most efficient conversion of a logic design 
into Actel's format, the designer should become familiar with the 
available Actel macros. The Actel Macro Library manual lists 
available Action Logic™ System macros, with AC performance, 
fan-in, and levels of logic given. 


Most common logic functions are readily implemented using Actel 
macros. This applications note describes several common 
functions created from Actel macros, and serves as a designer’s 
guide to efficient logic conversions. New macros are frequently 


Applications 
Note 


added to the Actel Macro Library. Always check the current 
software library shipped with the Action Logic System before 
creating a macro. 


All Actel macros are created with logic modules. Each logic module 
is an eight-input, one-output circuit, which consists of three 2-to-1 
multiplexors and one 2-input OR gate. See Figure 1. 


This circuit can implement a wide range of functions, and it 
interconnects efficiently through routing resources as required by 
the circuit application. 


Figure 1. Logic Module Schematic 





Actel macros fall into two categories, hard macros and soft macros. 
Hard macros consist of one or two logic modules. Soft macros were 
created for logic functions requiring more than two logic modules. 
Hard macros have fixed logic module positions and wiring 


interconnections to maintain specified AC characteristics. Soft 
macro logic modules are placed on the array as needed, and the 
wiring path is routed to complete the interconnection. Soft macro 
AC performance is determined after place and route. 





© 1891 Actel Corporation 
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Combinatorial Functions Existing In The Macro effective” function. Many macros have multiplexor functions built 
Library in; Figure 2 illustrates how several logic functions are combined, 


saving space and power. 
Since the basic macro building block (the logic module) is created 
from multiplexors, the multiplexor can be viewed as the most “cost 
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Equivalent schematic of a DFM macro 
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Figure 2. Sample Logic Function Equivalents 








Note from Figure 2 that the DFM multiplexed D-input flip-flop modules, while the more efficient version uses only two. This is a 
makes use of its inputs to achieve other functions by simply very powerful technique, and can be extended to other functions in 
connecting Vcc and GND as shown. The original circuitry the library. 


consisting of a gate and flip-flop would have used three logic 


Efficient Logic Conversions 


Actel provides every combination of 2-, 3- and 4-input functions needed. Figure 3 illustrates the logic savings realized using Actel 
with true and inverting inputs, so inverters are almost never macros in a standard circuit. 





Standard Circuit 
Actel Circuit 





NAND3B ]( 


Figure 3. Sample Logic Conversion 





The standard circuit requires five logic elements. The Actel circuit 
requires only two logic modules; it runs faster and uses less power 
than the standard circuit. 
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Counting Logic Modules 


When converting a logic design into an Actel array, do not count 
the number of gates in the original design; count logic module 
requirements. Many standard logic functions can be implemented 
with fewer Actel logic modules than expected, as shown in the 
examples in this application note. Table 1 gives typical logie module 
requirements for various hard macro functions. Many complex 
functions, such as TTL replacements, exist as macros in the Actel 


Table 1. Logic Module Requirements 


library. Consult the Actel Macro Library for a complete listing of 
macro types and actual logic module requirements. 


Both logic module and I/O module requirements must be 
considered in selecting an array for a given design. Refer to the 
ACT 1datasheet for array specifications. For more information on 
choosing an array for your design, refer to the applications note 
“Which Actel array should you use?” 








Macro Function Logic Modules Notes 
2-Input gate 1 Includes AND, NAND, OR and NOR gates. 
3-input gate 1 NAND3 and ORSC require two logic modules. 
4-input gate 2 See Note 1. 

XOR gate 1 

XOR OR gate 1 XOR/XNOR followed by an OR gate. 

XOR AND gate 1 XOR/XNOR followed by an AND gate. 
AND XOR gate 1 AND gate followed by an XOR gate. 

AND OR gate 1 12 types available. 

OR AND gate 9 types available. 

Buffer/Inverter 1 Least efficient of all macro types. 

2- or 4-input Multiplexor 1 Most efficient macro. 6 types avallable. 
Adders 2 7 types available 

D-latch 1 

D-latch with clear 1 

O-latch with enable 1 

Muttiplexed latch 1 2 input Multiplexed D-latch. 

D flip-flop 2 22 types available. Single output only. 
Multiplexed D flip-flop 2 

D flip-flop with enable 2 Allows multiple clocked systerns. 





Note: 


1. The designer could use an average of 1.5 logic modules for the 
four-input gate conversions, as 50% of these gates use only one logic 
module. 





Efficient Logic Conversions 


Implementing Logic Functions Using The designer can combine several functions using one MXT 
Multiplexors macro, as shown in Figure 4b. Note the savings of one level of logic, 


which saves power and improves speed. 


Many functions can be implemented using the simple two-input The function shown in Figure 4b exists as a macro (AO1C) in the 
multiplexor. Figure 4a illustrates four common functions and their Actel Library. 
multiplexor equivalents. 





Figure 4a. Multiplexor Functional Equivalente 


Y Logic Function 


Actel Circuit 





Figure 4b. Implementing Logic with the MXT Macro 





Creating Toggle Flip-Flops With Q Outputs 


Because Actel’s flip-flops have only one output, О or ON, creating 
a toggle flip-flop with a Q output usually involves the addition of an 
inverter (see Figure 5). Figure 5 also illustrates a way to eliminate 





the inverter and save one logic module. Note that the loading of the 
Q output remains the same for both circuits. The advantages are in 
eliminating one logic module, improving speed, and lowering 
overall power dissipation. 


With 


Q Output 


Without 
Inverter 


Q Output 


Figure 5. Creating Toggle Flip-Flops with Q Outpute 





The inverter circuit can also be replaced by using the JKF JK 
flip-flop; however, the DFM series offers more possibilities. 





The DFM flip-flop contains a multiplexor, at no additionat cost, 
and a D flip-flop, and by connecting the inputs as shown, the need 
for the inverter is eliminated. 


Efficient Logic Conversions 


Avoiding Gated Clock Inputs Note that the use of four-input multiplexors is especially 
dangerous. If both select lines change simultaneously, the output 
Actel does not recommend the use of gated clock inputs. Use D almost always glitches, and false clocking of the flip-flop can occur. 


flip-flops with enable to emulate a gated clock configuration. 
Figure 6 illustrates the use of D flip-flops with enable. 








Gated clock example 


Multiplexed clock example 





System clock No gated clock inputs 





No gated clock Inputs 


Figure 6. Avoiding Gated Clock Inpute 


Important: The techniques described in this applications note implementations, or “mappings,” that provide flexibility to the 
make use of logical tie-offs to Vpop and GND. Excessive use of place and route software. Tie-offs restrict the number of allowable 
tie-offs should be avoided. Macro cells frequently have multiple mappings. 
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The Actel Timer 





Introduction 


The Actel Timer is an interactive, menu-driven tool for analyzing 
design timing. It automatically provides delay data for all paths 
between clocked macros, and for any other paths specified by the 
designer. 


The designer may review the Timer output as total path delay or 
break up the path to show the delay through each macro. 


Setup 


When first entering the Timer, the designer selects operating 
conditions for the device: typical, commercial, industrial, or 
military. The worst-case temperature and voltage specifications of 
each condition are shown in Table 1. The designer may also choose 
to inspect either post-layout or pre-layout delay data. Pre-layout 
delays are estimates based on statistical propagation delays and 
loading. Post-layout delays include placement and routing data 
specific to the design, and thus is the most precise timing 
information. 


Table 1. Worst-Case Operating Conditions 


Condition Temperature (°C) Voltage (V) 
Typical 25 5.0 
Commercial +70 4.75 
Industrial +85 4.5 
Military +125 45 








Applications 
Note 


Design Analysis 


The Timer automatically extracts all delays between the pins of 
clocked macros and places the delay information in a default pin 
set. This information is useful for examining delays between pins, 
and it provides a quick reference for the design’s maximum clock 
frequency. In addition, the information may be manipulated easily 
to reveal clock line skew or any other signal skew in the design. 


Pin sets can be created and edited. These pin sets can be saved and 
recalled; pins may be added to or removed from any pin set. Timer 
commands are available to logically “AND” or “OR” groups ofpins 
to create new groups of pins. 


The Timer displays delays occurring between module input pins. 
For example, “delay1” in Figure 1 below includes the delay of gate 
“GO” plus the net delay of "Net0." 


The Timer provides a variety of options for design inspection and 
analysis. The “longest” and “shortest” commands allow you to view 
all delays between pin sets. “longest” shows the ten longest delays 
in descending order. “shortest” shows the next ten slowest paths, 
incrementally, for the entire set. The “longest” command display 
appears in Figure 2. The information includes the delay rank, the 
delay in nanoseconds, and the path end points. 








Figure 1. Input-to-Input Delay Measurement 
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1st longest path to all endpin 


The “path” command shows the delay along a specified path in 
more detail. As shown in Figure 3, “path” shows how each module 
contributes to the overall delay of the path. The information 
displayed includes the cumulative delay, the delay through the 


Rank 
0 
1 


о о 


Total 
84.2 
83.4 


77.0 
76.1 


Startpin First Net End Net End Pin 
F4:CLK NET1 NET2 F1:J 
F17:CLK NET3 NET4 F8:J 

° 

° 

. 
F5:CLK NET30 NET21 F0/U0:K 
F6:CLK NET32 NET32 F0/U0:J 


Figure 2. “longest” Command Display 








macro, delay type, number of loads, macro name, and net name. improve performance. 


ist longest path to U0/U7/F0/U0:K (falling) 


Total 
82.1 
75.6 
68.8 
56.8 
46.0 
36.0 
19.6 
12.5 
0.0 
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Delay 
6.5 





Load Macro Starting pin 
0 JKF 0 G12:B 
1 NAND2 G3:A 
2 XOR Gt:A 
1 NOR2A G14:A 
1 NOR4A G13:C 
7 AX1 G5:C 
1 AND2A G4:A 
6 JKF 1 U1:CLK 
16 UO:CLK 


Figure 3. "path" Command Display 


Using the "path" command, the designer can pinpoint delays in a 
design, then fine-tune design timing. For example, the designer 
may pinpoint where a heavily loaded macro is creating excessive 
delay, then use parallel logic in the design to reduce the load and 


Net name 
NET א‎ 
NET Y 


m 
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Using Actionprobe™ 


Diagnostic Tools 





Applications 
Note 





Introduction 

Actel's probe pin circuitry permits external monitoring of ACT™ 1 
device internal signals after device programming. This unique 
diagnostic feature allows 100 percent observability of a device. 
Observability reduces the time required for design verification and 
test vector generation; it also facilitates system troubleshooting. 
One hundred percent observability of all internal device signals is 
unique to Actel field programmable gate arrays; this feature is not 
available in conventional masked gate arrays or programmable 
logic devices. 


Two dedicated probe pins, PRA and PRB, provide this 
observability on ACT 1 family devices. Actel’s Debugger software 
and Actionprobe™ diagnostics hardware permit the connection of 
any two signal nodes on the device to the probe pins. Signal node 
assignments may be changed freely, under software control. 


Setup 


Actionprobe hardware consists of a tower probe with a footprint of 
the selected package. A socket on top of the tower probe holds the 
programmed ACT device in place. The Actionprobe unit, with a 
programmed ACT device, can be plugged directly onto a system 
board. The device can be verified and debugged as it receives 
real-time stimuli from the system. See Figure 1. 


The Actel Activator™ bus board drives the control signals SDI, 
DCLK, MODE, and GND via a flat ribbon cable. The MODE pin 
determines whether the device is in debug mode. SDI receives the 
serial addresses of the internal nodes from the Activator bus board. 
DCLK clocks the serial address into the device. When the device is 
being debugged in-circuit, SDI, DCLK, and MODE are 
terminated to ground through a > 10 КО resistor. Probe pins may 
be connected directly to a logic analyzer or oscilloscope. 















































































































































































































































































































































































































































































































































































































































































































































































































































































































































Figure 1. In-Circuit Debug Setup 
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In-Circuit Probing 


Changing the signal nodes is done simply by changing node names 
with the Debugger software. The newly assigned signals are 
connected automatically to the probe pins. Internal signals up to 10 
MHz can be monitored externally. The internal signal passes 
through an inverting buffer before reaching the probe pin. 


The “ICP” (In-Circuit Probing) command connects the probe pins 
to internal nodes. The syntax is : 


ICP node 1 node 2 


where "node 1" (node name) is connected electrically to PRA, and 
*node 2" is connected electrically to PRB. 


Probe Calibration 


The probe circuitry does not introduce any additional loading to 
the design, so the AC characteristic of the observed internal nodes 
remains unchanged. And, because probe propagation delay is 
independent of layout, probe delay remains unchanged for all 
points in the device. 


The skew of the probe pins can be measured, then used to calibrate 
for accurate measurement of propagation delay. When both probe 
pins are assigned to the same point on the device, the delay 








tsk = tpra - lena 


Figure 2a. Measuring Skew of Probe Pins 
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difference measured is the skew of the probe pins. This skew is 
subtracted from subsequent delay measurements in the circuit. In 
Figure 2a, for example, both PRA and PRB are connected 
electrically to node Net0. The delay difference is the skew, 
calculated as tsk = tpra - ters. Using the Debugger software, the 
slower probe (PRA) is assigned to node Net3. Figure 2b shows this 
configuration. In this example, actual propagation delay is the 
measured delay time between the output of G0 and the output of 
G3, minus the probe skew time. Actual delay is calculated as: tpp = 
tpra — tere ~ tsk 


Note: Due to the propagation delay difference between rising and 
falling signals, both probe pin signals must be either rising or 
falling when they are used to calibrate for delays 
measurement. 


Pin Assignment for Dedicated I/O Pins 


During device verification, dedicated pins SDI, DCLK, PRA, and 
PRB are assigned as special I/O pin. These pins should be assigned 
as non-critical, so the design is functional without them. After 
device verification, these pins can be assigned as regular I/Os by 
programming the security fuses. This disables the probe pins to 
prevent unauthorized device probing. 





PRB 


teo = tena - lens - tex 


Figure 2b. Callbrating for Accurate Propagation Delay 
Measurement 


Metastability 





Actel Metastability Characteristics 


Designers often have asynchronous signals coming into 
synchronous systems. Typically a flip-flop is used to synchronize the 
incoming signal with the system clock. 


If the asynchronous incoming signal does not meet the setup time 
requirement for the flip-flop, there exists a window of time where 
the incoming signal may cause the flip-flop to develop an unknown, 
or metastable, logic condition. Figure 1 shows this window as tw. 
Actual clock setup time of the flip-flop is shown as tsu; propagation 
delay of the flip-flop is shown as tco. Resolution time (tres) between 
flip-flop output and the next clocked device is the amount of time 
required for the metastable condition to stabilize. 


The duration of a metastable condition is probabilistic. But the 
designer can calculate how often a metastable state will last longer 
than a given duration. Mean time between failures (MTBF) can be 
calculated from the following equation: 





DATA IN 
CLK 
tw 
DATA או‎ 
tsu 
CLK 
со 

Q1 


м 


Applications 
Note 


1 


MTBF = 
"Cle -C2*tres 


fax " fast 
where the constants depend on the ACI™ 1 device characteristics, 
and: 
MTBF - Mean time between failures (s) 
fu, = System clock frequency (Hz) 
fast = Incoming data rate (Hz) 
e — Natural log base 
tres = Resolution time (ns) 
C1 = 10°/Hz 
C2 = 4.6052/ns 
(Value of C2 derived from circuit simulations.) 


Q1 


Figure 1. Metastable Condition 
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Jefe 








Sample Calculation 


Using the MTBF equation for a design with a system clock 
frequency of 10 MHz and a data rate of 1 MHz, various resolution 
times produce the results shown in Figure 2. Linear increases in 
resolution time cause exponential increases in MTBE 
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MTBF 
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Figure 2. Metastable MTBF ae a Function of Resolution Time 


Using the Actel Debugger 


as a Functional Tester 





Introduction 


Actel's Activator™ programming and diagnostics unit, together 
with Actel’s Debugger software, provide powerful tools to 
functionally test an ACT™ device. Device debugging begins after 
design configuration and device programming. Debugging is 
performed with the device inserted in the Activator unit. The user 
accesses Debugger software from the Action Logic™ System 
(ALS) main menu. 


Debugger functional test allows the user to observe any internal 
node of the device. The user defines the device inputs with 
Debugger menu commands or with a command file, or any 


UD AN 


X 
V 


2А М 5 U1 
IT > 
P2 X B= Оз 
рз X p U4 
P4 X [з = 
P5 X Be U6 
Ps X 
₪ МІ > us 
CLK >] p Uo 
CEP EX | 5 u10 
CET М Ë U11 
PE X > u12 


Applications 
Note 


combination of the two. Command files and test vector files are 
created with an ASCII text editor, then loaded into the Debugger. 
User-defined macros may be created in a command file, then 
executed in the Debugger. 


This applications note shows Debugger commands for a sample 
design. The sample design is Actel’s TT269 (TTL 74269), an 
eight-bit binary loadable up/down counter with count enable. 
Figure 1 shows the sample design; Table 1 shows the truth table for 
the part. PO through P7 are parallel load inputs; Q0 through Q7 are 
counter outputs; CLK is the counter clock; UD is the up/down 
count selector. Internal nodes (nets) should be labeled during 
design capture for easy reference during debugging. 








90 »- x QOUTO 
a »- > оойт 
92 > 54 QOUT2 
93 > > QOUT3 
Q4 > >. QOUT4 
95 > > QOUTS 
95 > > QOUT6 
9r > > оол? 
us > >] TCOUT 


Figure 1. TT269 
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Table 1. TT269 Truth Table Assigning Test Vectors 


The default input radix for all test vectors is decimal. To specify a 
binary, hex, or octal radix, add a 0b, 0h, or 0o prefix, respectively, to 
the vector (e.g., 001010 or 0h7e). Outputs are in binary format. To 








0 x x x 0 1 0 1 interactively define input test vectors, use the Debugger menu. 
0 x x x FF t FF 0 Alternatively, use input command files to define test vectors. To 
1 x 1 x x t Hold view outputs and internal nodes, print them to the PC screen 
1 i x א‎ X f Hold display or to an output file. 

1 0 0 1 x T Increment 

1 0 0 0 х T Decrement 


Action Logic System 
L INPUT1 INPUT2 INPUT3 INPUT4 INPUT8 CLOCKIN 
vector H INPUT5 INPUT6 INPUT7 
Assign TABADD INPUT2 INPUT5 CLOCKIN ברד‎ Set up list 
iw TABADD OUTPUT1 OUTPUT2 OUTPUT3 of signals 
Sep to observe 
High 
nos H CLOCKIN 
i 
5 STEP Apply stimulus; 
Assign allow time for 
L CLOCKIN INPUTS 
Print response 
H INPUT2 


STEP 
Step H CLOCKIN Revise input stimulus 


Satup STEP 
view PRINT 


FPrint 


INPUT2=0 Print values of signals 


In tablist t 1 
INPUT5-1 blist to pc screen 


CLOCKIN=0 
OUTPUT1-1 
OUTPUT2-1 
OUTPUT3-0 


Design: 
sample 





Figure 2. Typical Debug Command Sequence 
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Defining User Macros 


You may save time by creating user-defined macros for the 
Debugger. These macros may contain a series of basic Debug 
commands or may nest any combination of basic commands and 
user-defined macros. 


define (clk10) (repeat 10 (1 CLK) (step) (h CLK) (step) 


Using the Actel Debugger as a Functional Tester 


The sample macro in part A of Figure 3, c 1k10, provides 10 clock 
pulses to the pin CLK and prints the value of internal vector Q to a 
specified output file after each clock pulse. The OUTFILE 
command specifies the output file. 


Part B of Figure 3 shows a nested macro, c1k100, which executes 
the c1k10 macro 10 times, providing 100 clocks to the CLK pin. 


(fprint Q)) 





define (clk100) (repeat 10 )01%10(( 





Figure 3. Sample Command Macros 


Creating a Command File (TT269.cmd) 


The command file (see Figure 4) applies test vectors to the TT269 
counter. It redirects results to an output file, TT269. out and 
compares the output vector Q of the counter to an existing results 
file, TT269 . cmp. The following notes correspond to each line in 
the command file. 


Lines 1 and 2: The vector command defines eight parallel load 
input bits as vector P, and counter output as vector Q. 


Line 3: The tabadd command defines the internal or external 
nodes to be displayed or printed when the print or fprint 
command is executed. 


Lines 4, 5, and 6: The infile, outfile, and compfile 
commands define input and output files. infile opens a file 
containing input test vectors. outfile contains the output 
results. compfile contains the data to be compared against the 
current device status. Use the full path name of the file, and enclose 
it in question marks. 


Line 7: The define commands create user-defined macros. In 
this example, the c1k10 macro provides 10 clock pulses to the 
CLK input, fprint prints all nodes in the tabadd command toa 
file defined by out file. fcomp compares the status of vector Q 
to the file specified by сотр? ile. 


Lines 8, 9, and 10: Defines three user macros: up, down, and 
load. 


Loading a Command File 


The loadfile command loads a defined command file 
into the Debugger. Select setup | loadfile from the 
Debugger menu, then enter the full path name of the command 
file. For example: 


/designs/TT269/TT269.cmd 


Executing User-Defined Macros 


To execute any previously defined macro, type the macro at the 
command line while in the Debugger. 


(vector P PO P1 P2 P3 P4 P5 P6 P7) 
(vector Q QO Q1 Q2 Q3 Q4 Q5 Q6 Q7) 
(tabadd PE CEP CET UD P CLK Q TC) 
(infile "/designs/tt269/tt269.pat") 
(outfile "/designs/tt269/tt269.out") 
(compfile "/designs/tt269/tt269.cmp") 


(define (clk10) (repeat 10 (1 CLK)(step)(h CLK) (step) (fprint)(fcomp Q))) 


(define (up) (1 CEP CET) (h PE UD) (step) (clk10)) 


(define (down) (h РЕ) (1 CEP CET UD) (step) (c1k10)) 


(define (load) (1 PE CLK)(step)(fassign P)(h CLK) (step)) 





Figure 4. Debug Command File (TT269.cnmd) 











Running the Sample Command File 


In the following example, we will assign input test vectors from an 
input pattern file named TT269. pat, and write output results 
to a file named TT269 . out. The command file (TT269 . cmd) 
compares the counter's output vector Q to an existing results file, 


TT269.cmp. 


Output Results File (TT269. out) 
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C 
E 
P 


OoOOO0O00O00O00O00000000000000000000000000000000 
оооооооооооооооооооооооооооооооооооооооо 


ооооооооооннннннн H H H 0 0 0 0 O O O O O O H H H H H H H H H k 


P 


00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
00000000 
10000000 
10000000 
10000000 
10000000 
10000000 
10000000 
10000000 
10000000 
10000000 
10000000 
01000000 
01000000 
01000000 
01000000 
01000000 
01000000 
01000000 
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01000000 
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11000000 
11000000 
11000000 
11000000 
11000000 
11000000 
11000000 
11000000 
11000000 
11000000 


0 
L 
K 


кҥн HH HH HB HB H H H B tH H i H ы H H H ыыы B ы ы ы H ы ы H ы H i BH i B HH ыз 


10000000 
01000000 
11000000 
00100000 
10100000 
01100000 
11100000 
00010000 
10010000 
01010000 
00000000 
11111111 
01111111 
10111111 
00111111 
11011111 
01011111 
10011111 
00011111 
11101111 
11000000 
00100000 
10100000 
01100000 
11100000 
00010000 
10010000 
01010000 
11010000 
00110000 
01000000 
10000000 
00000000 
11111111 
01111111 
10111111 
00111111 
11011111 
01011111 
10011111 


ооооооонооооооооооооооооооооонооорооооооо 


Input Pattern File (TT269.pat) 


0ъ00000000 
0510000000 
0501000000 
0511000000 


Output Compare File (TT269.cmp) 


0b10000000 
0b01000000 
0b11000000 
0b00100000 
0b10100000 
0b01100000 
0b11100000 
0b00010000 
0b10010000 
0b01010000 
Ob00000000 
0b11111111 
0b01111111 
0b10111111 
0b00111111 
0b11011111 
0601011111 
010011111 
0600011111 
0611101111 
011000000 
000100000 
0610100000 
0601100000 
0611100000 
0600010000 
0510010000 
0001010000 
011010000 
000110000 
0501000000 
0510000000 
000000000 
011111111 
0601111111 
010111111 
000111111 
0511011111 
0001011111 
0510011111 





Three-Stating 


ACT" 1010/1020 Designs 





During board test and debug it is frequently desirable to place all 
chip I/Os into a three-stated condition. This provides isolation 
from other three-stating circuit devices connected to signals 
common to the ACT™ device. The three-stated condition also 
allows board test for trace integrity or insertion damage to ACT 
device pins. 


Three-stating a design is easy using the unique debug features of 
ACT device architecture. Three special pins on the ACT device 
facilitate three-stating: MODE, SDI, and DCLK. 


Three-State Pin Assignments 


PLCC/JQCC PGA 

44-pin 68-pin 84-pin 84-pin 
MODE 34 54 66 E11 
SDI 36 56 72 B11 
DCLK 37 57 73 C10 





Applications 
Note 


Pins SDI and DCLK should remain unassigned by the user or 
should be defined as input-only. 


You may three-state all user-defined pins regardless of their 
normal mode definition: input-only,  output-only or 
three-stateable. Each pin can be temporarily three-stated for test 
and debug purposes. 


Figure 1 shows the sequence of three-stating. Seven data bits are 
clocked into the device, using the SDI pin as data input, and DCLK 
as clock. The MODE pin distinguishes “test” mode from “normal.” 
The data sequence clocked is {0001011}. After clocking the 
seventh bit, all user-defined pins enter a three-state condition until 
MODE is taken low. 


Actel Actionprobe™ diagnostic tools allow 100% observability of 
internal nodes; ACT devices may also be isolated from external 
board circuitry. Together, these two features provide a powerful 
debugging feature previously unavailable in custom devices. 








Figure 1. Three-state Timing 


Notes: 

1. 0 V < Vy, < 0.5 V; 3.0 V < Vin < Vee 

2. Test mode configuration is a low frequency ( < 1.0 MHz) operation. 
3. All setup and hold conditions (ty, ts) > 250 ns. 
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Multi-ASIC and 
System Simulation Applications 


with Actel FPGAs Note 





Multiple ASIC and board-level design simulation produces a vsm design_name -w -ddesign_name.dtb 
dilemma: while individual simulation of component ASIC designs 
is possible, there is no easy system-level simulation process for 


designs consisting of ASICs and other component types such as primitive level. The WIR format file includes delay information 
from the file design_name.dtb. 

Actel offers complete timing simulation using Viewlogic® e Copy each WIR format file to the WIR subdirectory. The 
Viewsim® for board-level designs with an unlimited number of viewdraw.ini file specifies the location of the WIR subdirectory 
Actel field programmable gate arrays (FPGAs). (on the line beginning with “DIR”). 
The technique is as follows: e Modify each WIR format file (using non-document mode), 
e Design and simulate timing for each FPGA. This process adding the lines: 

includes Viewsim functional simulation, place and route using G GND 

Actel’s Action Logic"! System (ALS) software, and G VDD 

back-annotated timing simulation. between the last AP/AS line and the first M line in the file, as 
e From a DOS prompt in the \workview directory, execute the shown in Figure 1. 


following command for each FPGA design: 





Figure 1. Modifying the WIR Format File 
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The command causes the Viewsim compiler to create a flat WIR 
format file for the design to the Builtin or Viewlogic simulation 
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sefe 





e Edit the file \workview\viewdraw.ini to comment out the Actel 
library pointer and to add pointers for the other libraries to be 
used in the design. An example is shown in Figure 2 below. 


VIEWDRAW.INI (OLD) 


DIR 0 C: NDESIGNS 
DIR 6 C: NACTVIEW 
DIR 7 C: \WORKVIEW\BUILTIN 


e Create symbols for each Actel FPGA in Workview®. 
e Capture the system-level schematics and simulate the design. 


VIEWDRAW.INI (NEW) 


DIR 0 C: NDESIGNS 


| DIR 6 C:\ACTVIEW 
DIR 7 C: \WORKVIEW\BUILTIN 
DIR 11 CNWORKVIEWN74LS 





Figure 2. Modifying the viewdraw.ini File 
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Using the DRAM 


Controller Supermacro 





Introduction 


Processor-based systems require control logic for DRAM accesses 
and refresh. System designers can use a VLSI DRAM controller 
chip with a built-in processor interface. The chips are easy to use, 
but slow, and they operate in a limited number of configurations. 


One alternative is to build a controller out of discrete logic, 
including PAL®, delay lines, and standard components. Such a 
controller can be customized to the system, but it requires several 
components with the attendant assembly, board space, and failure 
rate costs. : 


Another way to implement the controller is to use a masked gate 
array. Although this provides a single-chip custom solution, large 
NRE costs and lengthy delays are inherent problems in developing 
masked gate arrays. 


The ideal solution is to use an Actel field programmable gate array 
(FPGA.) The Actel library contains a DRAM controller macro 
that may be customized for the target system and then combined 
with other logic required to implement the system. 
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Applications 
Note 


This note explains the operation of the macro and how to use it in 
developing a system. The design has been tested at a clock rate of 
16 MHz and can operate easily with both 120 ns and 100 ns 
DRAMs. 


The sample implementation has two address channel inputs and 
decodes two address bits to four RAS drivers. It has no byte 
decoding so that it addresses four megabytes on a 32-bit boundary. 


Architecture 


The architecture of the controller is shown in Figure 1. The three 
major blocks are the Refresh Timer, Memory Access Arbitrator, 
and Memory Timing Control. 


The refresh timer generates refresh requests on intervals set by 
decoding counter outputs. The arbitrator is a state machine that 
prioritizes memory requests and sends signals to the timing control 
to initialize memory cycles. The timing controller asserts the 
DRAM and address multiplexor control lines at the appropriate 
times for the cycle being executed. A detailed operational 
description follows. 





RW, 
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Figure 1. DRAM Controller Supermacro Architecture 
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Refresh Timer 


The refresh timer consists of two counters, a five-bit binary 
counter, and a three-bit Gray counter. The binary counter 
generates a pulse every two microseconds. Its carry-out is used to 
enable the three-bit Gray counter. The Gray counter outputs are 
decoded to set the refresh request flip-flop and to reset the 
counters. The Gray counter is used to avoid race conditions on 
state decodes, which can cause spurious refresh requests. 


The decode selected from the Gray counter determines the period 
of refresh requests. The macro’s range is from 2 to 14 
microseconds. Longer periods could be achieved by adding a bit to 
the binary counter with a loss of granularity in selecting the refresh 


The refresh request flip-flop remains set until the arbitrator begins 
a refresh cycle. The memory controller uses CAS-before-RAS 
refresh so no refresh address counter is required. 


Arbitrator 


The arbitrator accepts requests for memory cycles from the system 
processor, I/O channel, or the refresh flip-flop. If the memory is 
idle, the arbitrator goes to the state appropriate for the requestor 
and proceeds through the states of the cycle as shown in Figure 2. 









REFSTRT 


REFEND 


period. 
REFRESH 
REQUEST | 





PROCESSOR 
OR I/O CHANNEL 
REQUEST 


ADDSEL, 
ACCSTRT 


Figure 2. Memory Access Arbitrator State Graph 





If the memory is not idle, the new request will wait until the current 
cycle is complete. The processor has priority over the I/O channel, 
and the channel has priority over the refresh. Processor or I/O 
channel cycles begin by setting a flip-flop whose output (ADDSEL) 
selects the appropriate address bus and asserting a signal 
(ACCSTRT) to the timer to start the memory access. Refresh 
cycles assert a signal (REFSTRT) that begins the refresh cycle. 
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DRAM precharge timing requirements are not violated by 
adjacent memory cycles because RAS is turned off prior to the end 
of a cycle. After that, the arbitrator returns to the idle state (S0) 
before it can begin another cycle and reassert RAS. At a clock 
frequency of 16 MHz, this pattern ensures a precharge time of at 
least 90 ns. Faster clocks could require unused states to be 
employed as timing buffers. 


Memory Timing Control 


The timing controller begins memory cycles when a pulse is 
received from the arbitrator. 


A refresh cycle causes the CAS signal to be asserted first and then 
all the RAS lines a half cycle later. The refresh cycle isended by the 
arbitrator reaching a state and issuing a pulse (REFEND). 


Processor or I/O memory cycles begin by asserting the RAS line 
that is decoded from the two most significant address bits. The 
following clock sets a flip-flop that causes the address multiplexor 
to switch from the RAS address bits to the CAS. The multiplexor 
control then sets a flip-flop to cause the CAS line to be set a short 
time later. 


Using the DRAM Controller Supermacro 


The multiplexor line also goes to a flip-flop that will be set on the 
next clock and causes both the arbitrator and the DRAM control 
flip-flops to end the memory cycle. 


The timing controller may require more customization than the 
other modules in the macro to integrate it into other systems. 
Faster clocks require timing adjustments to the memory control 
signals. These adjustments can be made with additional flip-flops, 
clocked on either edge of the clock, to minimize the length of the 
cycle without violating any timing constraints. 














Using the DMA 


Controller Supermacro 





Introduction 


DMA controllers are used to supply main memory addresses and 

to handle transfers for 1/О channels in systems. They are 
frequently implemented with standard VLSI controllers, but they 
may not be adapted easily to some systems. Discrete logic offers a 
custom solution, but it is expensive in terms of the number of 
devices required. 


Actel offers a field programmable gate array (FPGA) solution with 
a DMA controller supermacro. The supermacro can be customized 
to accommodate any system, and can be combined with other logic 
in the FPGA to integrate other system functions. 
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Applications 
Note 


This note explains the DMA controller design and its use to control 
the flow of data between an I/O channel and memory in a 
processor-based system. 


Architecture 


The architecture of the controller is shown in Figure 1. It consists of 
an address counter, a transfer counter, and a state machine 
controller. 
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Figure 1. DMA Controller Supermacro Architecture 





The address counter provides addresses for channel memory 
accesses. The transfer counter counts the number of transfers in a 
block and interrupts the processor when transfers are complete. 
The state machine controls access to the processor data and 


address bus. It also serves as the interface between the system 
memory controller and the I/O channel. Figure 2 shows the state 
machine state graph. 
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Figure 2. DMA Bus Interface State Graph 


Operation 


The processor writes the beginning address of the block transfer to 
the address counter and memory index register. The counter has 20 
bits to access 1 M bit memories. The register has two bits to allow 
for accesses within one of four 1 M words. The counter and register 
may be used with a 4 X 1M word memory configuration, or may be 
modified for any other configuration. 


Loading the address counter/register is a two-step process, since 
the data port on the DMA macro is 16 bits and the counter/register 
is 22 bits. Upper and lower load controls are provided to load each 
part of the address. A single control is used to load the 
twos-complement of the block size to the transfer counter. 


When the channel is ready for a transfer, it asserts a memory 
transfer request (BREQ) to the DMA controller. The state 
machine on the DMA controller performs the bus arbitration cycle 
with the processor by asserting a processor bus request (PBREQ) 
until it receives a processor bus grant (PBGNT). The processor bus 
grant also goes to the I/O channel so it may deassert the original 
request and drive data on the processor bus. 


Next, the state machine requests a memory cycle (CMREQ) from 
the memory controller. When the memory cycle is complete, the 
DMA controller relinquishes the bus and increments the counters 
for the next cycle. The cycle is repeated until the transfer counter 
rolls over (TCRLVR), indicating the end of the block. The rollover 
can be used to interrupt the processor to reload the counters for the 
next transfer. There is also a rollover on the address counter 
(ARLVR), which can be used to interrupt the processor. 











Using the SCSI Interface 


Controller Supermacro 





Introduction 


The rapid success of the SCSI interface as a standard has caused a 
corresponding increase in the use of SCSI controller chips. NCR 
manufactures several popular SCSI controllers. The NCR family of 
controllers features internal FIFOs, parity pass, and a variety of 
system bus interfaces. 


Some of the devices can do multibyte system data bus transfers; 
some are optimized for a standard bus. However, investments in 
software and hardware development, as well as differences in price, 
may preclude the use of these devices in some systems. In such 
cases, a data transfer interface is needed between an eight-bit SCSI 
interface controller and the system bus. 
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Applications 
Note 


This note explains how the Actel SCSI Interface Controller 
supermacro controls the flow of data between the system bus and 
an NCR 53C90B. The design can be adapted easily to a specific 
system. The macro uses about 60% of an ACTTM 1010 and less 
than a third of an ACT 1020. 


Architecture 


The architecture of the supermacro is shown in Figure 1. It consists 
primarily of a double-buffered bidirectional transceiver and a state 
machine. There is also a parity checker, which can be used to 
interrupt a processor when a single-bit parity error is detected. 
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Figure 1. SCSI Interface Controller Macro 


The controller cycles data through, requests the processor for data 
transfers, and passes control signals to the 53C90B to transfer data 
and access internal registers. 


The heart of the design is the state machine, which controls the 
timing of all internal and external control lines. A state graph for 
the state machine appears in Figure 2. 
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Figure 2. SCSI Interface Controller State Graph 
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Operation 


Preparing for Data Transfer 


The processor accesses the 53C90B internal registers by first 
writing the appropriate data to the controller, then reading or 
writing the 53C90B. 


The processor prepares the controller to transfer data by writing 
data to it that tells it the transfer direction and tells it to begin when 
the 53C90B controller asserts DREQ. 


Writing 


The processor writes a 32-bit word with four parity bits to the buffer 
and prepares the system as detailed above. When the controller 
gets a DREQ signal from the 53C90B, it transfers the data in the 
processor-side buffer to the SCSI-side buffer, then requests the 
processor for more data. It then multiplexes each byte with its 
parity bit out to the controller data bus and issues an acknowledge 
to the 53C90B. 


If the request for data from the processor has been granted by the 
time the last byte is sent, the process will repeat, until the 53C90B 
stops requesting data. 


Using the SCSI Interface Controller Supermacro 


If the 53C90B stops asserting DR EQ, the state machine will wait 
until the signal returns before sending the next byte. If the transfer 
is of an odd number of bytes, or if the transfer stops on an odd-byte 
boundary due to a failure, the processor can initialize the state 
machine by activating the controller chip-select line. 


Reading 


The flow of data from the controller to the system bus begins when 
the 53C90B asserts DREQ. The controller latches each byte of a 
word until the buffer on the 53C90B side is full. Then the controller 
transfers the word to the other buffer, sets the data transfer request 
flag, and resumes latching bytes. 


If the processor has responded to the data transfer request by the 
time the next word is latched, then the process continues. If not, the 
state machine will stop until the buffer is read. 


Parity Error Detection 


The parity error detection logic checks each byte that passes 
through when the DACK signal is asserted. If an error is detected, it 
sets a flag that could be used to interrupt the processor. The flag 
can only be cleared by the processor accessing the controller. 











UART Design 





Introduction 


This applications note demonstrates the design of a Universal 
Asynchronous Receiver and Transmitter (UART) circuit on an 
Actel field programmable gate array (FPGA). The architecture of 
ACT™ 1010/1020 arrays provides the flexibility needed for the 
circuit design. Actel macros are used to speed and simplify the 
design process. 


UART Description 


The UART is a serial communication device that handles 
transmission and reception of information between computers and 





Applications 
Note 


peripherals (modems, terminals, printers, etc). This note 
describes a general-purpose UART that utilizes 189 logic modules 
(approximately 700 gates) in the ACT 1010. 


Figure 1 shows the UART configured as a controller between a 
computer and a peripheral device. Communications protocol is a 
standard RS232 serial interface. 


The UART transmission format is shown in Figure 2. A start bit 
begins the transmission, followed by the data bits, and then an 
optional even or odd parity bit. A programmed number of stop bits 
concludes the transmission. 








Figure 1. UART System Block Diagram 


MARK 





Figure 2. UART Transmission Format 
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UART Design 


The UART application described in this note operates similar to an 
Intel 8251 operating asynchronously. See Figure 3 below. 
When the CPU sends a data character, the UART automatically 


adds a start bit, followed by 7 (or 8) data bits, an even or odd parity 
bit, and a stop bit. The UART then transmits this “packet” as a 
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serial data stream on the TXD output. It shifts out serial data at a 
rate equal to 1/16 the clock frequency (pin CLK16X). 


When the CPU receives a data character, a falling edge on the 
UART's RXD input triggers the beginning of a start bit. The start 
bit is again checked to prevent a false start. If the start bit is valid, 
the data character is shifted in, followed by one stop bit. A serial 
stream of data is then converted to parallel format, ready for CPU 


retrieval. 


TXD 
(Transmit Data) 
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Figure 3. UART Function 
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Transmitter Design 


Figure 4 shows the transmitter's buffer and shift register. 
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Figure 4. Transmitter Buffer and Shift Register 
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The transmitter first latches the parallel data word 120-127 from the 
CPU in the buffer TLATCHO-TLATCHT. It then transfers the 
data character to the shift register TSHIFTO-TSHIFT7 through 
the B inputs. After the first clock (TXCLK), TXDONE goes low 
selecting the A inputs of TSHIFTO-TSHIFT7. TXCLK then 
serially shifts out the data character. A series of AND gates 
(TGATE0-TGATE6) detect a logic high, indicating the stop bit, 
propagating through the shift register. 


Transmitter Buffer 


The transmitter buffer TLATCHO-TLATCHT uses transparent 
latches to store a data character. The level-sensitive transparent 
latch consumes only one ACT 1 logic module. 


Because of the wide variety of latches available in the ACT 1 
library, designers may build efficient registers for any application: 
synchronous clear, gate-enable, active high or low outputs, and 
active high or low gates. 


Clocks 


It is also possible to construct a two-phase clock system by 
combining active high gate latches and active low gate latches. See 
Figure 5 below. 
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Figure 5. Two-Phase Clock Using Transparent Latches 
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The ACT 1 macro library includes a large selection of flip-flops, The clock enable feature allows separation of data storage and 
which are edge-sensitive storage elements. These flip-flops are transfer while maintaining a common clock system. See Figure 6 
available with asynchronous clear, asynchronous preset, and clock below. 

enable inputs. Negation bubbles are available on all inputs and 

outputs. 





Primary Clock (CLK) 





| Desired Second Clock | | 


























Enable 
Data In 
Data Out 
Figure 6. Clock Enable Function 
Transmitter Shift Register The MUX flip-flop uses two logic modules (as does a standard 
Ў . . Е flip-flop). It is thus possible to construct а parallel load/serial shift 
The transmitter's shift register TSHIFTO-TSHIFT7 uses flip-flops register without using additional logic. 
with a built-in 2:1 multiplexor (МОХ) on the D input. The built-in 
МОХ can emulate two-input logic functions, as shown in Figure 7. 
GND A 
— a ° 
5 DFM 
> CLK 
A A 
B Y D Q — Vop B Q 
DF1 DFM 
> CLK 
[^ CLK 


Figure 7. Reducing Logic Levels with MUX Flip-Flops 
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Transmitter Clock Circult An external clock signal (CLK16X) drives the state machine. Gates 
G13 and G14 decode the outputs Q3-Q0 of the state machine and 
The clock circuit is shown in Figure 8. It is implemented as a 4-bit generate an internal clock (TXCLK) for the transmitter shift 
Gray code counter with both synchronous and asynchronous reset register. Only one bit changes at a time, eliminating glitches on the 
(COUNT and RESET, respectively) and count enable (GOTX). transmitter clock (TXCLK). 
CLK16X 


TXRDY 






Q3 
B-A 


NAND3A 


AND3B D~ 
CNTPRESET 
G15 


Figure 8. Transmitter Clock Circuit 
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Two-Level Logic Decoders 

The transmitter state machine takes advantage of the OR/AND Decoder functions require signals and their complements. Using 
macro, a two-level logic function using only one logic module. In negation assertion levels available on Actel macros simplifies the 
combinatorial logic, use of two-level logic functions can logic path and conserves module usage. See the 2:4 decoder in 
significantly reduce the module count. There are different types of Figure 9. 


OR/AND macros with a wide choice of input combinations. Refer 
tothe ACT 1 family data sheet for a list of OR/AND and AND/OR 
macros. 


It is important to note that propagation delay through a logic 
module is independent of the implemented function. Thus the 
designer should use negation bubbles and two-level logic macros 
whenever possible. 








Circuit with Actel Macros 





Standard Circuit 


Figure 9. 2:4 Decoders 


Parity Generation From the shift register, data serially enters the parity generator on 


А : ; the transmit line (TXDI). The generator inserts the parity bit just 
The transmitter parity generator is programmable for odd or even x : A 
ity. Th : ircuit is sh Aa | prior to the stop bit. A 2:1 МОХ (PMUX) selects either data or 
parity. The parity generator circuit is shown in Figure 10. parity bit to the data transmit line TXD. 
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Figure 10. UART Parity Generator 
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Logic Module Conservation 


Efficient utilization of Actel macros allows implementation of the 
UART transmitter section using only 80 of the 295 available logic 
modules on the ACT 1010 array. The parity generator uses the 
XOR/AND macro. The XOR/AND, XOR/OR, and AND/XOR 
functions each utilize only one logic module. 





ACT 1 devices are particularly efficient at implementing 
multiplexor functions. Both 2:1 and 4:1 MUXs can be implemented 
in only one logic module each. MUXs implement internal bus 
designs in place of internal three-state drivers on ACT 1 devices, as 
shown in Figure 11. 





Figure 11. Internal Three-State Function Implemented with ACT 1 Multiplexor 


Receiver Design 


Shift Register and Buffer 


Figure 12 on the next page shows the schematic of the receiver's 
shift register and buffer. A logical "1" is loaded into the receiver 
shift register RSHIFTO-RSHIFT9. After the first clock 
(SHFTRX), XFERRX goes low, selecting the A inputs of the 
MUX flip-flops. The lowstart bit precedes the serial data shifts into 
the register. When a complete serial data character shifts into the 





register, the parallel data character transfers to the buffer 
(RLATCHO-RLATCH?) for CPU access. 


The receiver design takes advantage of the single-module 
transparent latch with MUX, minimizing required circuitry. 
Similarly, the built-in MUX can emulate other two-variable 
functions. 


The MUX also can emulate an inverter, effecting a negation at the 
data input of the transparent latch. 
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Figure 12. Receiver Shift Register and Buffer 
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Recelver Clock Circult The state machine has both synchronous and asynchronous resets 
The receiver clock circuit is a 4-bit Gray code counter (similar to the (RXCOUNT RESET): A ilter сеш. (RXDFILTER) prevents 
: тауа ерх false starts due to errors on the data line (RXDI). The receiver 
transmitter clock circuit) and is driven by the same external clock clock (SHFTRX) is decoded from the outputs of the state machi 
signal (CLK16X). See Figure 13. їр ו‎ 
(Q0-Q3). 
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Figure 13. Recelver Clock Circuit 
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Parity Checker serial data stream and compares it to the incoming parity bit. 
The parity error flag (PARITYERROR) is set accordingly. See 
Figure 14. 


The receiver parity checker is programmable to either odd or even 
parity. The parity checker generates a parity bit from the incoming 

















PARITYERROR 
Figure 14. Parity Error Generator 
Design Complexity Table 1. Logic Utilization 
The UART transmitter section can be implemented using only 80 Transmitter Logic Modules 
of the 295 available logic modules on the ACT 1010 device. The Buffer and Shift Register 45 
Teceiver section requires only 78 logic modules. 
Clock Circuit 26 


These figures represent 27.1% utilization of the array for the 
transmitter section and 26.4% utilization of the ACT 1010 for the Parity Generator 9 
receiver section. 








Total 80 
Table 1 provides a breakdown of logic utilization for the UART Recel 
design on Actel’s FPGA. өсөүөг Logic Modules 
Receiver Shift Register and Buffer 42 
Receiver Clock 30 
Recelver Parity Checker 6 
Total 78 
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Actel TA181 ALU 





Introduction 


The venerable TTL 74181 is a well-known and useful arithmetic 
logic unit (ALU). Its four select lines, mode, and carry-in bits 
combine to offer over 40 unique arithmetical and logical functions. 
Many designers dislike losing such functionality as they move from 
discrete to ASIC technology. 


Designers may retain the function of the 74181 using Actel field 
programmable gate arrays. Actel’s macro library includes “TA 181," 
a soft macro functionally identical to the TTL component. 


Implementation 


The 74181 depicted in popular TTL data manuals has six logic 
levels (seven for A=B output). While this representation may not 


S2 S3 


Applications 
Note 


be accurate at the transistor level, it typifies an ASIC design 
implementation. 


The Actel TA181 design requires only four logic levels (five for 
A = B output). It uses components available from Actel's library. 
Actel logic module flexibility enables such "logic compression" by 
implementing a wide variety of logic functions. 


For example, each bit pair in the ALU passes through a section of 
logic, where select lines qualify the pair before outputting them to 
the remaining logic. The qualification logic, as it appears in TTL 
manuals, is shown in Figure 1. It uses eight gates and three logic 
levels. 





Figure 1. TTL 74181 Qualification Logic 
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Figure 2 shows Actel's version of qualification logic. By contrast, it 
requires only two modules and one level of delay. The driving gate 
is an OR function, as opposed to NOR in the TTL diagram. 
Bubbling (inverting) inputs on each gate driven by the section 
adjusts for this difference. All gates in the Actel library have 
versions with bubbled inputs. 


One module implements two three-input ANDs to two-input OR 
(AO4A), and the other module implements two two-input ANDs 
to three-input OR (AOSA). Combined, these modules create the 
qualification logic. 


The remaining logic in the TA181 macro is similar to that found in 
the TTL 74181, with some improvements. M inputs require no 
inverter, because M-driven gates have bubbled inputs. Single logic 
module macros implement the AND-OR functions. 


Another example of logic compression is in macro OAS, used in the 
logic generating the ЕЗ output. The macro implements the 
expression: 

Y = /(/A*/B*/C) + (A*/D)) 


in a single module. OAS is a clear example of the savings in delay 
and modules achievable with the versatile Actel logic module. 


Conclusion 


Actel's ALU 181 design uses only 32 logic modules, only about 10% 
of an ACT™ 1010 gate array. It is an excellent example of the Actel 
logic module’s adaptability, both in implementing logic functions 
and in compressing logic, for efficient, high-performance designs. 











Figure 2. Actel’s TA181 Qualification Logic 
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Introduction 


Adders, in a variety of sizes, perform a very common function in 
digital designs. They are frequently implemented using 
carry-look-ahead (CLA) logic to speed carry propagation. 
However, as adder size increases, CLA logic requires increasingly 
wider gates and more logic levels; such requirements degrade adder 
performance. 


Implementation 


Actel solved the CLA logic problem by designing its family of fast 
adders using the Carry Select (CS) method. CS adders partition the 
addition into groups of bits. Two additions are performed 
simultaneously for all the groups. One addition is done with a 
carry-in of one and the other with a carry-in of zero. The two group 
sums and their carry-outs are connected to two-input multiplexors 
(MUXs). 

When determined, the carry from the lower groups selects the 
group that supplies the sum to the outputs. The carry-out for the 
group selects the sum and carry for the next higher group, and so 
on. 


The speed of group additions and carry propagation determine the 
speed of the CS adder. 


Applications 
Note 


Group Size 


Since carries ripple within groups, small group size is an advantage. 
Small groups, however, require more groups for a given number of 
bits, lengthening the carry propagation path. Thus, the design 
trade-off is the number of delays to a group sum versus the number 
of delays required to propagate a carry. 


By optimizing group sizes based on adder macro delay parameters, 
Actel adders are very fast. 


Carry Propagation 


As previously mentioned, carry propagation speed is a limiting 
factor in carry select adder performance. Group carries must be 
selected, just as the sums are. Unlike sums, carries select higher 
carries and so forth. This creates the problem of cascading carries 
through multiple logic levels. 


The flexibility of Actel’s logic module alleviates cascading carry 
problems by implementing a cascaded two-input MUX in a single 
module. The logic is shown in Figure 1. A two-input MUX drives 
the select for another two-input MUX. The macro allows two-level 
carry selection using only one module and in only one delay. All 
Actel adder macros use the cascade MUX macro for rapid carry 
propagation. 





Figure 1. Cascade Multiplexor Macro 
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The Actel Fast Adder Family 


The following table lists Actel adders, their speeds (driving one 
load), and the number of logic modules used: 











Macro Name Number of Bits Typical Delay Levels of Logic Number of Modules 
(ns) 
FADD8 8 36 4 37 
FADD12 12 45 5 63 
FADD16 16 47 5 78 
FADD24 24 56 6 120 
FADD32 32 65 7 160 





Eight-Bit Twos 


Complement Multiplier 





Introduction 


Multipliers are vital to systems design. In the past, designers 
integrated VLSI multiplier chips into their designs. As ASICs 
become more popular, ASIC-based systems increasingly utilize 
multiplier macros. Previously, field programmable devices did not 
have sufficient architectural flexibility and connectivity to 
efficiently implement multipliers. Actel has designed an eight-bit 
twos complement multiplier that easily fits on an ACT?M 1010 or 
ACT 1020 field programmable gate array (FPGA). 


Algorithm 


High-performance multipliers form the product of two numbers 
using combinatorial logic and an array of adders. Actel’s multiplier 
is based on the Baugh-Wooley! algorithm (Equation 1), and uses 


Applications 
Note 


conventional full adders for twos complement multiplication. 
When the size of the multiplier and multiplicand are known, the 
Baugh-Wooley formula can generate a series of product terms. In 
this example, when both multiplier and multiplicand are eight bits, 
M and N equal 8 in the formula. 


Each product term contains “2” raised to a power. The product 
terms are arranged into an array according to their binary power. 
Adding the columns of the array results in the product of the two 
original operands. 


Inthe generic architecture, AND gates generate the product terms, 
and the adders sum them together. The adder array has a 
parallelogram shape. Dividing the parallelogram into four sections 
eases the design process. Each section generates a set of partial 
products, which are then added together. 


P= a,b 2n + X ab2iti 
m-1 Dn-1 i-o 2o ^^ 


n-2 
+2 1-25 + 22-1 Fa, 2714 ал +È inb? Ü 
1= 


m-2 


A2 C25 + 2"7 + bua 24 bia +2 абы?!) 


Equation 1. Baugh-Wooley Algorithm 





Actel Multiplier Macro Architecture generators have similar structure, the design was copied and 
modified to create the others. 

The Actel multiplier macro architecture appears in Figure 1. It 
consists of three major sections: the partial product generators, 


Wallace tree adder, and final product adder. 


The Wallace tree adder sums the partial products and passes two 
binary numbers on to the final product adder. The final product 
adder performs arithmetic necessary to produce the product. The 
The partial product generators implement the adder array sections following is a discussion of each element in detail. 


using Actel library components. Since the partial product 
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Figure 1. Actel Multiplier Macro Architecture 





Partial Product Generator 


Each partial product generator requires as input four bits from 
each operand. The bits pass through AND functions, which output 
through an array of adders to generate eight partial products. 
Because the partial product generators operate in parallel, all 
partial products are produced simultaneously. 


The schematic has an array of adders with inputs either from an 
AND gate or the sum from another adder. The Actel library adder 
FA2A adds a two-input AND term with inverted inputs to another 
input and a carry-in. Most of the adders in the generator use FA2A, 
requiring only two logic modules to implement both the AND and 
full adder functions. 


Wallace Tree Adder 


The multiplier's Wallace tree section sums columns of terms from 
partial product generators. Summing a column in binary arithmetic 
requires tracking carries generated to the higher power-of-two 
positions in the number. The Wallace tree uses three-bit adders to 
input up to three bits of the column. It feeds the adder sums into 
adders taking in other bits of the column, and feeds carry-outs to 
the inputs of adders summing the column in the next higher bit 
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position, and so on. The tree output is a series of sum and carry-in 
bits which may be added in a conventional adder as two binary 
numbers, producing the final product. 


Because both carry-outs and carry-ins are active-low on the Actel 
adder macros, the adders can be used in a Wallace tree structure as 
a three-bit adder which outputs a sum and a carry. The carry-out 
from an adder in one column inputs directly to the carry-in of the 
next higher column. Other than the adders, no logic is required to 
design the Wallace tree. 


Final Product Adder 


The multiplier's last section is a binary adder. The Wallace tree 
completely sums some lower columns of bits, which pass from the 
tree to the product bus. The tree outputs higher bits as two binary 
numbers to be added. The bits comprise the sum bit for each bit 
position and its respective carry-in. The final adder adds them 
together for the remaining bits of the product. 


The final adder uses the same design technique as the Actel family 
of fast adder macros. The architecture of the final product adder is 
shown in Figure 2. 


Elght-Bit Twos Complement Multiplier 








Multiplexors 


Sum Bus 


Figure 2. Final Product Adder Architecture 








The Carry-Look-Ahead Problem 


Adders are usually implemented using carry-look-ahead (CLA) 
logic to speed carry propagation. However, as the size of the adder 
increases, the CLA logic requires wider gates and more logic levels; 
such requirements degrade adder performance. 


Carry Select Addition 


To solve the CLA problem, Actel designs its fast adders using the 
Carry Select (CS) method. CS adders partition the addition into 
groups of two or three bits. Two additions occur simultaneously for 
all groups. One addition has a carry-in of one and the other a 
carry-in of zero. The two group sums and their carry-outs connect 
to two-input multiplexors. 


When determined, the carry from the lower groups selects the 
supply that sums the outputs. Next, the carry-out for the group 
selects the sum and carry for the next higher group, and so on. 


The CS adder speed is determined by group additions speed and 
carry propagation speed. 


Group Size 


Since carries ripple within groups, small group size is an advantage. 
Small groups, however, require more groups for a given number of 
bits, lengthening the carry propagation path. Thus, the design 
trade-off is the number of delays to a group sum versus the number 
of delays required to propagate a carry. 


By optimizing group sizes based on adder macro delay parameters, 
Actel adders are very fast. 
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Carry Propagation 


As previously mentioned, carry propagation speed is a limiting 
factor in carry select adder performance. Group carries must be 
selected, just as the sums are. Unlike sums, carries select higher 
carries and so forth. This creates the problem of cascading carries 
through multiple logic levels. 


The flexibility of Actel's logic module alleviates cascading carry 
problems by implementing a cascaded two-input MUX in a single 
module. The logic is shown in Figure 3. A two-input MUX drives 
the select for another two-input MUX. The macro allows two-level 
carry selection using only one module and in only one delay. All 
Actel adder macros use the cascade MUX macro for rapid carry 
propagation. 


Figure 3. Cascade Multiplexor Macro 


The Actel Advantage 


The eight-bit twos complement multiplier illustrates the type of 
macros that can be implemented on Actel field programmable gate 
arrays. The design method detailed here suits multipliers of many 
different configurations. This technique also applies to 3X7, 9X4, 
or 6X5 multipliers, depending on system requirements. 


Other multiplier algorithms also apply. A magnitude array 
multiplier design using the Cavanagh method! requires less logic 
than the eight-bit multiplier above. Other alternatives include the 
Booth and Bit-Pair Recording methods’. 


The multiplier macro may be designed as part of the logic in a field 
programmable gate array. The remaining gate array modules are 
available for other design purposes, either other arithmetic 
functions or glue logic. For example, Actel’s 8X8 multiplier macro 
uses 241 of 546 modules on an ACT 1020. Nearly 60% of the chip is 
still available for other purposes. 


In any design, Actel provides the technology to complete the entire 

gate array development process including schematic capture, 

simulation, place and route, timing analysis, and device 

programming. 

1. Cavanagh, Digital Computer Arithmetic Design and 
Implementation, McGraw-Hill, New York. 1984. 
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Analyze FPLD Architectures, 
Performance and Development 
Tools to Optimize Design- 
Dependent Selection 


by Dennis McCarty, Sr. Applications Engineer 
Actel Corporation, Sunnyvale, California 


Os the past decade, field- 
programmable logic devices 
(FPLDs) have undergone dramatic 
changes. The devices of the late 
1970s and early 1980s primarily con- 
sisted of fuse-programmable PALs, 
pioneered by Monolithic Memories 
Inc. (now part of Advanced Micro 
Devices), and a handful of alternative 
versions from competitive manufac- 
turers. The devices were used by 
circuit designers to integrate discrete 
TTL logic into higher density pack- 
ages. The designer's goal was to take 
four or five discrete logic chips and 
replace them with one. And while 
the existing device architectures were 
not that sophisticated, that modest 
goal was easily accomplished using 
fairly inexpensive devices and pro- 
gramming hardware. 

Consider, for example, the device 
choices available to the designers of 
ten years ago. Clearly, bipolar was 
the most prevalent technology in use 
with devices being "hard" pro- 
grammed by blowing some sort of a 
fuse — PALs, FPLAs and alterna- 
tives were good examples of that 
genre. The differences between com- 
peting devices related to the number 
of I/O, the number of product terms 
and packaging options. 

Today, FPLD density levels have 
increased as have the number and 
types of devices that compete in the 
logic-integration arena. Indeed, it's 
common to find a dozen FPLDs on 
the market today that can easily 


accommodate the functionality of 
20 random TTL logic devices. And 
CMOS has replaced bipolar as the 
dominant technology of choice for 
FPLDs. 

Perhaps the biggest change has 
taken place in the area of device 
programming and the enabling tech- 
nology. While conventional bipolar 
fuse technology continues in use 
today, EPROM, EEPROM, static 
RAM (SRAM) and antifuse pro- 
gramming technologies have quickly 
been adopted by the majority of new 
FPLD companies entering the in- 
dustry. And device programmers have 
been replaced with sophisticated 
workstation-based development en- 
vironments that combine powerful 
new hardware and software that 
speeds the design time for complex 
circuit designs. 

But as the complexity of the new 
breed of FPLDs has increased for 
designers, so has the difficulty in 
determining the suitability of the 
various devices for particular ap- 
plications. Moreover, with new man- 
ufacturers emerging each year with 
devices of different architectures and 
design methodologies, the 1990s sys- 
tem designer’s selection task is likely 
to become more difficult as the 
decade progresses. Even device names 
have gotten more sophisticated and, 
as a result, more confusing. Today, 
the terms PLD, EPLD, FPGA, 
PGA, LCA, PMD, PAL, PLA, 
EEPLD as well as others are all 








generally being used to refer to 
devices that fall into the FPLD 
category. 

Interestingly, once armed with the 
proper set of criteria, system design- 
ers can accurately and safely select 
preferential devices. The guidelines 
that follow take aim at achieving 
that goal for designers. Specifically, 
the criteria relates to comparing 
device technologies, understanding 
density issues and evaluating develop- 
ment tools. 


Comparing Device 
Technologies is Key 
Because memory-based technol- 
ogies are now in use, the question of 
device volatility becomes an issue. 
Specifically, the volatility of the pro- 
gramming element used to intercon- 
nect the logic on device determines 
whether programming is permanent 
or must be performed each time 
power is applied. One-time program- 
mable devices use fuses or antifuses. 
Reprogrammable devices are either 
electrically reconfigurable(EEPROM 
or SRAM technology), or optically 
reconfigurable(EPROM technology). 

Only the SRAM technology is 
volatile in that, if power is inter- 
rupted, the device must be reloaded. 
SRAM or EEPROM devices must 
be used if the design requires that the 
device be dynamically reconfigured 
in operation. 

If reconfiguration is not required, 
then the advantages each technology 
offers need to be considered. Some 
designers are attracted to reprogram- 
mability during the development 
cycle because it allows them to try 
design iterations without wasting 
components. That is aconsideration, 
but it must also be noted that there is 
a design time and production cost 
penalty for the software and hard- 
ware support required to boot the 
device. As always, when evaluating 
systems, all the relevant costs should 
be considered. For example, losing 
parts to design iterations is not nearly 
as costly as placement and routing 
(P&R) that require manual inter- 
vention. 





In certain applications, such as 
military, nonvolatility is preferred. 
In other applications, such as 
aerospace/satellite environments, 
radiation-tolerant nonvolatile tech- 
nology may be the only option due 
to the threat of a loss of program 
information from radiation. The 
vendors of devices being considered 
for use in a hostile environment 
should be asked what conditions 
could adversely affect their products. 

Beyond device volatility, perform- 
ance becomes another elementin the 
selection criteria. But how is 1t pos- 
sible to determine performance and 
functionality without testing or, at 
least, simulating the device? Actually, 
the performance a design achieves 
depends on how it is implemented 
within a device. A qualified estimate 
of how the design might be imple- 
mented usually can be made once 
the architecture of the device is 
understood. But while some manu- 
facturers claim that their device per- 
formance is entirely predictable, es- 
timates are valid only insofar as the 
P&R performs as expected. It may 
be easy to anticipate how the FPLD 
software will implement a state 
machine orcounter. It is quite another 
thing to guess how it will accom- 
modate an adder or other large 
combinatorial function. 

The maximum clock frequency is 
often determined by the number of 
logic levels required between flip- 
flops, the pad-to-flip-flop time, or 
the flip-flop-to-pad time. The last 
two times may be found in the device 
data sheet. Estimating the number 
of levels of logic in the longest path 
in a design requires more thought. 
The number of terms and/or vari- 
ables driving the input to a flip-flop 
is one consideration. 

Devices such as PALs, that have 
AND-OR array feeding flip-flops, 
can implement terms containing large 
numbers of variables as easily as 
terms containing only one. Other 
devices have a set number of vari- 
ables that its logic elements can 
accept. Terms that exceed that num- 
ber require more than one element, 


causing elements to be cascaded to 
implement the logic. Cascading logic 
increases the delay path. Moreover, 
some AND-OR array devices have a 
fixed number ofterms in their arrays. 
If this number is exceeded, the extra 
terms must be implemented else- 
where and brought to the array 
plane. Bringing in terms from out- 
side the array causes additional delay. 

Routing delays are another con- 
sideration. Devices that fall into the 
FPGA (field-programmable gate 
array) camp have delays that increase 
with fanout. The amount of ad- 
ditional delay due to fanout is pub- 
lished by the manufacturers in tab- 
ular or equation format. Other, more 
fixed architectures like PALs, don’t 
exhibit fanout-dependent delays, but 
have the problem of fixed routing 
paths whose delays cannot be im- 
proved. 

In either case, however, post P&R 
timing analysis is required to deter- 
mine the anticipated delays. For 
example, a path for one application 
might pass through blocks that were 
not needed for a different circuit, 
thus adding path delays. Thus, de- 
signers need to carefully anticipate 
path differences and resultant delays 
for each application. 


The Path to a Masked 
Device 

While FPLD devices have proven 
themselves as cost-effective produc- 
tion vehicles in all but extremely 
high volume applications (greater 
than 10,000 units per month), many 
designers are taking advantage of 
the FPLD’s fast device turnaround 
times and are employing them as 
prototyping vehicles as well. Indeed, 
it is often easier and faster to develop 
an FPGA for a prototyping environ- 
ment than it is to implement the 
equivalent system design using dis- 
crete logic devices on a printed circuit 
board. As such, many designers today 
use programmable devices for design 
prototyping before migrating the 
design to a masked array or standard 
cell for production. Use of an FPLD 
as a temporary solution is an attrac- 
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tive alternative to the time and ex- 
pense of developing a masked device. 
The programmable device may even 
be used to sustain production until 
demand for the product justifies the 
masked solution. This strategy can 
effectively shorten product introduc- 
tion time by months. 

The ease with which the design 
may be migrated from one tech- 
nology to another determines the 
risk, cost, and time involved in mak- 
ing the transfer. In general, the closer 
the architecture of the device to that 
of a masked device, the easier the 
migration path. Some gate array 
vendors have software that converts 
netlists of programmed devices to 
masks. The impact that the choice of 
a programmable device would have 
оп a conversion should be considered 
as early in the life cycle of the 
product as possible. 


The Gate-Density 
Confusion 

Perhaps no other criterion has 
created as much controversy as has 
the issue of device density of FPLDs. 
Considering the difficulty of compar- 
ing different technologies, the issue 
is understandable. Indeed, accurate 
assessments of gate densities is crucial 
in not only design but also in com- 
parative cost analysis. 

Because FPLD device manufac- 
turers often use conventional masked 
gate arrays and related density mea- 
surements as points of comparison, 
any similarities that they can make 
are often drawn, and the resulting 
marketing hype adds to the confu- 
sion. Specifically, many FPLD man- 
ufacturers equate the density of their 
devices in terms of “number of gates.” 
Unfortunately, manufacturers all 
don’t count gates the same way. 
Other FPLD manufacturers don't 
refer to gates at all. As a result, 
FPLD device comparisons using 
density measurements are error 
prone. 

The problem doesn’t exist when 
comparing conventional gate arrays. 
Masked gate array vendors specify 
their devices as having a certain 














number of two-input NAND gates 
and the percentage that can be actu- 
ally utilized in a design. With masked 
arrays, higher-input gates and other 
logic functions are made from speci- 
fied numbers of these two-input 
gates. Such comparisons cannot be 
made with FPLDs because each 
manufacturer uses a unique archi- 
tecture and functional logic module, 
or block within the device to imple- 
ment various circuit functions. In- 
deed, it is very difficult for designers 
who are used to designing with 
conventional logic, and even those 
who have had experience with masked 
gate arrays, to understand what they 
can do with a more complex type of 
logical element or module. Clearly, 
designers need a more accurate way 
to gauge FPLD device densities. 


Only Two Basic Device 
Types 

The first step toward achieving 
that goal is to understand that 
despite what vendors claim, there 
are only two basic types that fall into 
the FPLD category — programmable 
logic devices (PLDs) and field- 
programmable gate arrays (FPGAs). 
Understanding which devices are 
one or the other is critical to proper 
device selection. For example, PLDs 
have a fixed interconnect architec- 
ture, comprise AND-OR array planes 
that feed flip-flops and usually ex- 
hibit static power dissipation. De- 
vices in this category include those 
with labels such as PALs, FPLAs, 
EPLDs and EEPLDs. 

The distinguishing characteristics 
among the different vendors, aside 
from programming specifics (fuse, 
EPROM, etc), is the number of flip- 
flops that the device contains. An 
FPGA, on the other hand, has a 
flexible interconnect technology with 
no fixed AND-OR plane, but does 
incorporate an array of logical build- 
ing blocks that are often structured 
loosely around a masked gate array 
architecture. PGAs and LCAs are 
names some manufacturers use to 
describe their specific type of FPGAs. 

It’s important for designers to 


keep in mind that there are areas of 
similarities between FPGAs and 
PLDs that often add to the confu- 
sion. For example, both categories 
of devices incorporate combinatorial 
logic that feed other logical devices 
(flip-flops, gates, etc). The combina- 
torial logic that exists with FPGAs, 
however, is more flexible and more 
easily configurable than that within 
the typical PLD. In addition, the 
FPGA’s combinatorial logic is in 
more of an array configuration, so it 
is interconnected differently. 


The Elusive “Equivalent 
Gate” 

Once the PLD/ FPGA distinction 
is made, the second step in gauging 
FPLD device densities involves un- 
derstanding device architectures to 
determine capacities rather than rely- 
ing on manufacturers’ stated density 
claims. To be specific, the question 
of gate densities or capacities with 
either category can be reduced to 
analyzing the amount of logic that 
can be realized from the logic build- 
ing blocks in the device, and the 
amount of blocks on the device that 
actually can be used in a design. 

To facilitate the analysis, the con- 
cepts of Gate Array Equivalent Gates 
and PLD/LCA Equivalent Gates 
needs to be defined. A gate array 
equivalent gate is the two-input 
NAND gate mentioned earlier and 
can be considered as a real or usable 
gate. 

A PLD Equivalent Gate corres- 
ponds to the number of gates that 
are assigned to the device by the 
manufacturer. The number may re- 
flect the total number of gates on the 
device, or it may be based on some 
assumptions about the effective use 
of gates. Each building block on an 
FPGA contains an inherent number 
of gates. The way the building block 
is used in a design, as well as the 
architecture of the device itself, de- 
termines how many of the gates can 
actually be used to implement logic. 
The distinction doesn’t arise with 
masked gate arrays because the low- 
est level of logic is the gate itself. 


When the level of granularity of the 
logic element is higher, as it is in all 
PLDs and FPGAs, the question of 
how many of the gates on the device 
can actually be used is important. 


One Gate, Two Gates, 
Three Gates, . . . 

With this understanding іп тіпа 
and armed with information derived 
from material supplied by manufac- 
turers,consider the counting methods 
and derived device densities of two 
representative FPGA vendors - Actel 
and Xilinx -and PLD supplier, Altera. 

Actel determines the gate capacities 
of its FPGAs by comparing actual 
designs and library macros with the 
same logic implemented on a masked 
gate array. Actel’s ACT™ 1 archi- 
tecture is similar to a channeled gate 
array in nature. Because the gate 
counts for masked gate array designs 
are easily determinable, Actel uses 
the masked arrays as a convenient 
standard to characterize its devices 
in gate-array equivalent gates. The 
Actel architecture consists of an 
array of logic modules with the rows 
and columns separated by routing 
tracks. While it is more complex 
than a gate, the Actel logic module is 
much simpler than the basic element 
of competing architectures. Based 
on comparisons to masked gate 
arrays and analysis of actual designs 
over the past two years, in practice 
Actel’s module is equivalent to 3.2 
gates. 

The FPGAs from Xilinx, mean- 
while, are based on a Configurable 
Logic Block (CLB) architecture and 
the vendor counts gates using another 
technique. Each CLB contains two 
dedicated flip-flops and a combina- 
torial function generator. The gener- 
ator is a static RAM that is loaded 
during the configuration process. 
Five inputs are used to address the 
SRAM so that it may implement 
any five-variable function or two 
four-variable functions with some 
restrictions. 

Xilinx 1/O Blocks (IOBs) contain 
an input latch, an output flip-flop 
and a three-state output buffer. When 
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Xilinx calculates the densities of its 
devices, the company counts all the 
gates in all the IOBs on a device as 
usable gates. The total number of 
gates in an IOB is said to be 13. 
Xilinx determines the density of 
their devices by multiplying the 
number of CLBs and IOBs in the 
device by the number of gates they 
are claimed to represent. The func- 
tion generator on the CLB is evalu- 
ated as two four-input generators. 
Each is specified as having a maxi- 
mum gate count of between 20 to 22 
gates and a minimum of one. There 
would thus be between 2 and 44 
gates per CLB function generator. 
The vendor then assigns an “average 
number of gates” to the function 
generator based on an assumed 50 
percent utilization efficiency. 

The CLB is rated as having a total 
of 42 gates, 30 gates from the function 
generator and 12 gates from the two 
flip-flops. These gates account for 
nearly 88% of the total number of 
gates on the XC3000 series devices. 
After multiplying all possible gates 
in the CLBs and IOBs on the device 
the products are summed to give the 
total number of gates. A fraction of 
the total (between about 50% and 
65%) is given as the “usable gate 
count” by the manufacturer. This 
arbitrary approach is similar to that 
applied to PLD architectures. 

Altera’s MAX architecture repre- 
sents perhaps the most architectural- 
ly sophisticated PLD and the subject 
of this comparison. The largest de- 
vice in that family, the EPM5128, 
incorporates 128 flip-flops (the typical 
PAL has only eight) and a bus 
structure that differs from conven- 
tional PLDs. The firm’s PIA (pro- 
grammable interconnect array) bus 
allows the company to interconnect 
more cells together than convention- 
al PLDs, but the architecture of the 
basic unit that designers must deal 
with has not changed. It is the AND- 
OR macrocell. The architecture of 
the device consists of Logical Array 
Blocks (LABs) that can be intercon- 
nected via the PIA bus. The bus acts 
to connect LAB outputs to other 


LAB inputs. 

Each LAB contains 16 macrocells 
and an expander. The macrocell 
contains an AND-OR structure feed- 
ing into an exclusive OR gate which, 
in turn, drives a flip-flop. There are 
also inverter/ buffers in the macrocell 
to allow for true or complementary 
logic. The expander produces ad- 
ditional terms which may be fed to 
any of the macrocells in its LAB. 
Altera doesn't specify gate counts 
for its device but rather uses macro- 
cells to characterize capacity. 


Trust Designs, Not Vendor 
Claims 

Although the accuracy ofthe gate 
densities derived from the counting 
schemes itemized above will not be 
disputed, designers are well advised 
to disregard manufacturers' claims. 
Instead, by employing the analysis 
techniques and equations that follow, 
system designers will be able to 
accurately gauge competitive device 
densities by understanding the various 
architectural distinctions and by 
analyzing their own designs. Because 
the ultimate goal is to measure the 
number of gates a designer can 
expect to be able to use on an 
FPLD/LCA device rather than bas- 
ing this number on assumptions or 
small-scale macro implementations, 
prior to the evaluation that follows, 
it is interesting to review the results 
of actual designs that were analyzed 
relative to Actel, Xilinx and Altera 
devices as to gate usage. The results 
will serve as a check on the gate- 
counting equations being presented 
laterinthe discussion. The results of 
these tests are open to scrutiny by 
any who request it. 

For Actel, nine customer designs 
were randomly selected and a group 
of TTL macros from the macro 
library for analysis. The designs in- 
cluded a DRAM controller, 32-bit 
accumulator, graphics controller, data 
transfer controller, and a SCSI inter- 
face controller. The TTL macros 
included counters, multiplexors, and 
decoders. Table 1 lists the macros 
analyzed. 
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TABLE OF TTL MACROS 
ANALYZED IN ACTEL AND 
XILINX LIBRARIES 


3-to-8 Decoder 
8-to-1 MUX 
4-to-I MUX 
74161 
74164 
74194 
74377 
74139 










Table 1. 


Four customer designs were simi- 
larly randomly selected for data for 
Xilinx along with an AMD (a Xilinx 
second source) published application 
article and the same TTL macros 
selected from the Actel library. The 
Xilinx designs included a keyboard 
interface controller, an RLL2,7 coder, 
and a video controller. 

Counting gates for Altera devices 
was a different situation. When the 
Altera MAX+PLUS development 
system compiles a design it uses the 
resources on the device to implement 
it, but the details of the implementa- 
tion are hidden. When a macrocell is 
reported by the development system 
as having been used there is no way 
to know how much of its logic was 
used. It could range from a few gates 
to all the logic and the flip-flop in the 
macrocell. What is required to ac- 
curately measure the capacity of the 
device would be designs whose 
schematic gate counts were known 
and, more importantly, that are 
known to have utilized virtually all 
the resources on the chip. 

Fortunately, four such examples 
exist. These are some capacity bench- 
mark studies performed by Altera. 
The four designs were examples of 
common logic circuits that were 
chosen so as not to favor any type of 
architecture over another. The de- 
signs include a Data Path, Timer 
Counter, State Machine and Arith- 
metic functions. The benchmark 
measures how many of each type of 
design could be accommodated by 
the device being tested. Because the 
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number of gates in each of the 
designs is known, as well as the 
number of the designs that could be 
placed on a fully-loaded device, the 
capacity of the device can be found 
by simply multiplying the gates per 
design by the number of designs. 


total. The designs were used to 
verify the formula as a predictive 
tool. 

For consistency, individual logical 
elements were counted (source: LSI 
Logie HCMOS Design Manual, 1986) 
as follows: 


Table 2 shows the gate counts for 
the various Actel designs. The de- 
signs are ordered from the lowest 
number of gates per module to the 
highest as calculated by simply divid- 
ing the total number of gates in the 
design by the total number of mod- 


ules. As may be seen in Graph |, 
there is variation in the utilization in 
the modules with the average being 
about 3.2 gates per module. 


Test Results Analyzed Element Gates 
The designs were analyzed by Inverter .5 
counting the number of gates and Buffer 5 


building blocks in them. The gates Gate, two-input* 1.0 Table 3 shows the gate counts for 
were next categorized according to Gate, three-input 1.5 the various Xilinx designs. Due to 
whether they were combinatorial Gate, four-input 2.0 the large number of flip-flops in the 
gates, flip-flop (i.e. flip-flop or se- Gate, five-input 2.5 Xilinx architecture (there are two 
quential latch) gates, or combinatori- Gate, exclusive-Or 3.0 per LCA which is why the fraction 
al gates associated with a flip-flop. MUX, 2:1 3.0 

The last category refers to combina- MUX, 4:1 6.0 ACTEL 

GRAPH 1. LINEAR MODEL 


torial gates that immediately precede Latch 4.0 
















the D input of a flip-flop in a design. Flip-flop, D 6.0 
The classification is significant be- Flip-flop, JK 10.0 
cause it enabled the equations that Adder, full 10.0 


follow to be developed that allow 
designers to estimate the number of 
gates that could be accommodated 
on a device. The estimate would be 
based on the number of flip-flops in 
the design as a percentage of the 


(* Note: Bubbled inputs on gates 
were counted as .5 gates unless all 
inputs were bubbled which counted 
for none.) 









Combinatorial 


FF Gates Gates Total Gates 



















Design 1 24 144 116 260 
Design 2 143 672 846 1518 
Design 3 171 835 461 1296 
Design 4 162 962 572.5 1534.5 
Design 5 42 252 628.5 880.5 
Design 6 22 132 131 263 
Design 7 127 714 622 1336 
Design 8 178 1103 486.5 1589.5 
Design 9 127 740 833.5 1573.5 
Design 10 84 356 412 768 
Average 





Table 2. Actel Design Gate Counts 





Combinatorlal 
FF Gates Gates Total Gates 


Design 1 } 228.00 186.50 414.50 
Design 2 Y 144.00 123.00 267.00 





Design 3 + 448.00 372.00 820.00 
Design 4 ; 192.00 149.00 341.00 
Design 5 . 594.00 319.00 913.00 
Design 6 А 1136.00 399.50 1535.50 
Ахегаре 





Table 3. Xilinx Design Gate Counts 





of flip-flops to CLBs ranges from 
zero to two) it was expected that 
designs would achieve densities in 
proportion to the number of flip- 
flops used. As may be seen in Graph 
2, there is a strong (98%) correlation 
between the fraction of the total 
gates in the design that are used to 
implement flip-flops and the number 
of gates realized per CLB. The aver- 
age number of gates per CLB for the 
designs analyzed was 8.5. Table 4 
shows the number of gates for each 
Xilinx device at 8.5 gates per CLB. It 
also includes CLB utilizations that 
are typical according to the Xilinx 
design guide. 
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Table 5 shows the gate counts for 
the Altera-generated benchmarks. 
The average was 1559 gates. Graph 3 
plots the total number of gates in the 
benchmark against the fraction of 
gates used as flip-flops. Another 
data point provided by Altera comes 
from the data sheet for the EPM5128, 
where it claims that an equivalent to 
a 74161 counter occupies 3% of the 
device. The counter requires 54 gates 
to implement, which would put the 
capacity of the device at 1800 gates. 
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Architectures Determine 
Actual Device Density 
Because there are almost as many 
ways to count gates as there are 
manufacturers, had additional devices 
been analyzed, alternate gate count- 
ing methods would have been de- 
scribed. With these test results as a 
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guide, the analysis techniques and 
formulas that follow can be used by 
designers to determine how their 
application-specific design will fit on 
the various devices available, regard- 
less of density claims. 

PLDs will be considered first, to 
sce how much logic they can accom- 
modate and what types of designs fit 
best оп that architecture. In general, 
thetypical PLD architecture contains 
a huge number of combinatorial 
functions and only relatively few flip- 
flops. The limiting factor with this 
technology is the number of flip- 
flops. 

Designers must first analyze their 
own designs to determine the ratio 
of combinatorial two-input gates to 
flip-flop gates, keeping in mind that 
a flip-flop function represents six 
two-input gates. For simplicity, treat 
a three-input gate as 1% two-input 
gates, and a four-input gate as two 
gates. Next, the following formula 
can be used to determine if the 
intended PLD will accommodate 
the design. The equation accurately 
determines the density of any PLD 
device and is more precise than gate 
count claims because it accounts for 
flip-flop limitations of PLDs. 








Typical 

Gates Total Device Usable Gates IOB Total Advertised 
Device CLBs  perCLB Gates Utilization Gates IOB perlOB Gates Gates Gates 
XC3020 64 8.5 544 0.85 462 64 3.25 208 670 2000 
XC3030 100 8.5 850 0.8 680 80 3.25 260 940 3000 
XC3042 144 8.5 1224 0.75 918 96 3.25 312 1230 4200 
XC3064 224 8.5 1904 0.7 1333 120 3.25 390 1723 6400 
XC3090 320 8.5 2720 0.65 1768 144 3.25 468 2236 9000 








Fraction 
FF Gates 


Designs per 
Device 


Gates per 
Device 


Combinatorial 


FF Gates Gates Total Gates 


Arithmetic 245 293 0.16 

Timer Counter 143 287 0.50 

State Machine 67 139 0.52 1 

Data Path 73 169 0.57 9 
Average 








Table 5. Altera Benchmark Gate Counts for EPM5128 
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Gt = ))6 gates x #FF)/PercentFF))» 
(Df), where: 


Gt = upper gate limit on the PLD 
#FF = the total number of flip- 
flops in the PLD 

PercentFF = the fraction of gates 
that reside in flip-flops of the 
actual circuit 

Df=aderating factor, or maximum 
device utilization factor available 
on manufacturers’ data sheets. 
For PLDs, Df is equal to approx- 
imately 0.9. 


The term, PercentFF, is a useful 
way of distinguishing among designs 
and is relatively easy to estimate 
based on the number of flip-flops in 
a design and the approximate total 
gates. Independent research conduc- 
ted by Stanford University on scores 
of logic designs indicates that ap- 
proximately 45% of all gates used in 
logic designs are used to implement 
flip-flops and the remainder is used 
for combinatorial logic. According 
to the LSI Logic handbook, six two- 
input gates are required to imple- 
ment a flip-flop. Thus, designers can 
use 45% as a representative number 
if knowledge about their own designs 
is absent. Thus, using the equation 
and this 45% figure, the upper bound 
on the capacity of a PLD containing 
128 flip-flops may be found to be: 


Gt = ((6x128)/(.45))(0.9) = 
1700(0.9) = 1530 gates. 


Thus, a PLD design that uses all 
the flip-flops available could expect 
to achieve densities of no greater 
than 1700 gates (Df = 1.0), regardless 
of what the manufacturer states when 
PercentFF is 0.45. For values of 
PercentFF between 0.35 and 0.55, 
Gt ranges from 1975 to 1250. This 
number is about a third the density 
claimed for such devices by some 
PLD vendors and consistent with 
the actual test results detailed in 
Table 5. While it might be possible to 
find a design that uses the fixed PLD 
architecture more effectively by cre- 
ating flip-flops from combinatorial 
logic, such circuits are generally much 


slower than built-in flip-flops and 
are not generally usable. 

The FPGA architectures can be 
evaluated using similar equations to 
determine capacities. As indicated, 
the two types of FPGA architecture 
available are the LCA and ACT 1 
family of channeled gate arrays. 

LCAs that employ dedicated flip- 
flops have a better balance between 
combinatorial logic and flip-flops 
than PLDs, but for most designs, 
there are more flip-flops than neces- 
sary and combinatorial logic resourc- 
es, thus, become the limiting factor 
in logic density instead of flip-flops 
as with PLDs. 

The designer can determine how 
well his circuit will fit on an LCA 
device by examining his schematics 
to see how many inputs and how 
many gates were used on the logic 
feeding a flip-flop. For example, if 
there is a four-input block, how 
many gates are typically feeding that 
block : 3, 4, 5? In the selection of 
vendor-generated TTL macros ana- 
lyzed in the previous section, there is 
an average of 3.6 combinatorial gates 
per CLB. If four or fewer inputs 
were used most of the time, then the 
average number of gates prior to flip- 
flops can be used in the following 
formula to find the upper bound for 
the gate utilization on an LCA (if 
you need more than four, say, five, 
you get fewer gates and you need 
two blocks): 


Gt = (Aff x #CLB x 
#pre-FF gates)/(1-PercentFF)) 
(Df), where: 


Gt = upper gate limit on the LCA 
#ff = number of flip-flops per CLB 
#CLB = number of logic blocks in 
the device 

#pre-FF gates = number of two- 
input gates before the flip-flop 
(e.g. 3.6) 

(1- PercentFF) = fraction of gates 
in the user’s combinatorial logic 

Df = aderating factor, or maximum 
device utilization factor available 
on manufacturers’ data sheets. 
For LCAs, Df ranges between .55 
to .85. 





(Note: The formula ignores gates in 
I/O blocks. Flip-flops in I/ O general- 
ly improve performance, and effec- 
tively increase routing resources 
rather than gate count because there 
are no combinatorial functions in 
the I/O for use with those flip-flops. 
For designs with PercentFF in the 
range 0.35 to 0.55, LCAs are usually 
combinatorial gate limited.) 

Using this formula on an LCA 
with 144 CLBs and assuming an 
average of three pre-FFgates, 45% 
of the gates reside in the flip-flops, 
and a utilization of 7595, a sample 
calculation would be as follows: 


Gt = ((2x144x3)/.55).75 = 0 
gates. 


Thus, an LCA with 144 CLBs and a 
claimed gate density of 4200 gates 
has a maximum 1180 usable gates, 
and only 1570 if fully utilized (Df = 
1.0, under the above conditions). 
The results of this equation show 
results somewhat higher than found 
in the actual design of Table 4. 


Similar capacities can be calcu- 
lated with ACT I devices. The ACTI 
architecture uses a large number of 
relatively small building blocks and 
has no dedicated flip-flops. There- 
fore, neither the PLD nor LCA 
formulas detailed above can be used. 
The ACT 1 architecture isn't con- 
strained either by flip-flops or com- 
binatorial logic. However, certain 
assumptions can be made and a 
formula derived that will also give 
designers consistently accurate ca- 
pacity calculations. As in the previ- 
ous cases, assume 45% of all gates 
reside in device flip-flops. Again, 
other values may be used based on 
actual design characteristics. 

Consider, forexample, the ACT1020, 
a 2000-gate FPGA. This device in- 
corporates a total of 546 logic mod- 
ules and two are used to implement 
one flip-flop. Most gates, multi- 
plexors and latches use one module. 
With these architectural considera- 
tions in mind, the following formulas 
can be used to give designers a fairly 
accurate capacity reading on an 
ACT 1 device: 





#LM = #MF + #MC Eq. 1 


#MFx3.4/(#MFx3.4+#MCx3.5) 
= 45 Eq. 2 


Gt = (MFx3.4 + #MCx3.5XDf) 
Eq. 3 


where: 
#LM = total number of logic 


Two logic modules can be used to 
implement a flip-flop and some 
combinatorial logic. The sequential 
modules in the designs in the previ- 
ous section attained 3.4 gates/ 
module. 

Therefore, the 2000-gate ACT 1 
device with 546 modules: 


Solving Eq. 2 yields: 
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Evaluating Development 
Tools 

While the equations enable de- 
signers to evaluate architectures, a 
hands-on approach is often preferred 
by engineers. And although device 
manufacturers supply development 
systems that allow designers to 
analyze the design and program the 
devices here too, as with technology 


modules on an Actel device #MF = .84#МС. and density comparisons, care must 
#MF = number of modules used Substituting this result into Eq. 1 be taken to achieve accurate results. 
for flip-flops gives: For example, when doing a competi- 
#MC = number of modules used 546 = 1.84#MC tive analysis of development systems, 


for combinatorial functions 
Gt = upper gate limit on the LCA 
זכ‎ = the module utilization factor. 


Equation I states that the sum of 
the combinatorial and sequential 
modules equals the total number of 
modules. 

Equation 2 states that the assump- 
tion that 45% of the gates in an 
average design are used as flip-flops. 

Equation 3 computes the expected 
number of gates used in an ACT 1 
device. 

The logic module can implement 
one to five combinatorial gates with 
3.5 being typical. Df is listed by 
Actel as ranging from .85 to .95, so 
.90 will be used for these calculations. 


#MC = 296 and #MF = 546-296 = 
250. 

Finally: 
Gt = (296x3.5 + 250x3.4)(.9) = 
(1886)(.9) = 1697 gates. 


It should be noted that the results 
of this equation are also consistent 
with the tested gate counts deter- 
mined for Actel devices in Table 2 
when the 3.4 gates/module utiliza- 
tion (approx.) factor is used. 

Using these formulas for PLDs, 
the results indicated in Table 6 were 
calculated for a representative num- 
ber of devices from eight different 
manufacturers. 


beware of trying to save time by 
doing an identical small design on 
each. A small design is one that uses 
only a small fraction of the chip. To 
see what the system and the device 
can really do, it is important to test 
them to their limits. A simple 
example may make device develop- 
ment and programming tools appear 
to be stronger than they actually are. 

Most systems vary a great deal in 
cost and capability. Without becom- 
ing familiar with a system it is 
difficult to understand its capabil- 
ities. Some of the most important 
features to examine are the place 
and route software, timing analysis 
tools and platform support. 

The P&R software is the heart of 














Dedicated Fraction 

Vendor Device Flip-Flop? Utilized! No Gates* No I/O Reprogrammable 
Actel 1010 0 .90 916 57 N 
1020 0 .90 1697 69 N 
Altera 5064 64 .90 770 36 Y 
5128 128 .90 1530 60 Y 
AMD 16R8 8 1.0 105 16 N 
22V10 10 1.0 135 22 N 
Atmel 2500 48 1.0 640 38 Y 
ICT 7024 40 1.0 530 22 Y 
7040 48 1.0 640 30 Y 
Plus Logic 2020 144 9 1255 72 Y? 
Signetics 2552 52 1.0 575 53 үз 
Xilinx 3020 128 9 630 64 Y 
Y 
Y 
Y 
Y 





Table 6. Usable Gates 
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the development system. The soft- 
ware assigns the logical resources of 
the device to implement logic func- 
tions described by the design netlist. 
It uses the routing resources (i.e. 
tracks and interconnect elements) to 
connect the logic functions together. 
The extent to which the software 
succeeds will determine the logic 
capacity of the device and, to a large 
extent, its performance. All FPGA/ 
PLD vendors include P&R software 
with their systems, but the software's 
ability to implement designs is con- 
strained by the power of its algor- 
ithms, the architecture of the device, 
and the amount of interconnect 
resources available. 

Place and route on a PLD is 
simpler and faster than on an FPGA 
due to the less flexible architecture 
of PLD devices. PLDs usually place 
and route successfully unless the 
design exceeds the resources of the 
device. 

The FPGA architecture allows 
for great P&R flexibility, but requires 
more of the software because there is 
more to do. The router may fail to 
complete successfully requiring that 
design be changed or, if allowed, 
routed by hand. The designer has 
much more control over FPGA im- 
plementations than PLDs by being 
able to assign placement directly or 
attaching criticality to nets in the 
design. 

FPGAs require that less of the 
device area be devoted to wiring 
resources than PLDs. As devices 
scale to higher levels of integration, 
PLDs will require a larger propor- 
tion of the device for wiring than 
FPGAs. For this reason, FPGAs 
will be better able to sustain larger 
device densities. 

The speed with which the router 
operates is also important. If each 
design iteration requires many hours 
to run the P&R tool, then it signi- 
ficantly impacts the design cycle 


Notes: 


time. PLD routing times are short as 
there is relatively little for the soft- 
ware to do. Routing times for 
FPGAs may range from minutes to 
hours depending on the routing re- 
sources available, the sophistication 
of the software and the complexity 
of the design. The problem is exacer- 
bated if the software runs for a long 
time and still fails to route some 
nets. At that point the designer may 
be forced to P&R the entire design 
manually. 

Manual P&R is tedious and time 
consuming. When manual interven- 
tion is required, every design iteration 
requires the same amount of time as 
the first and the development time is 
multiplied by the number of itera- 
tions, While manual P&R allows the 
designer complete control over the 
design implementation, it requires 
him to have detailed knowledge of 
the physical architecture of the device. 
Nevertheless, FPGA architectures 
and tools are available that permit 
fully automatic placement and rout- 
ing in under 30 minutes. 

Designers have little control over 
the delays from the P&R in PLDs. 
Moreover, the fixed path delays limit 
any opportunities for timing optimiza- 
tion. PLD delays do not vary with 
load, but are determined by the 
individual delays of the device ele- 
ments that make up the path. There's 
more variation in FPGA delays than 
in PLDs and the delays are load 
dependent. Architectural flexibility 
allows for delay optimization which 
may be done manually or by using 
software tools. 

The timing analysis tools available 
with the various systems vary con- 
siderably in both the quality and 
quantity of the information they 
provide about the system. The timing 
analysis tools that come with the 
development system should allow 
the designer to examine timing 
details of the design. If there is a 





timing problem, the timing analyzer 
should be specific enough to allow 
the problem to be pinpointed so the 
design or the P&R may be modified. 
The more commands available and 
the more powerful the commands 
are the faster the designer can analyze 
the design. 

If the vendor-specific software 
supports other CAE systems, there 
should be a way to back-annotate 
the delay information to the CAE 
system simulator. Back-annotation 
allows the design timing information 
to be seen graphically and under 
simulated operating conditions. 

Finally, designers need to consider 
platform support. Designers who 
already have invested in hardware for 
a CAE system may also want to use it 
to develop programmable devices. 
Designers need to make sure the 
vendor's interface package for his 
workstation is complete. For exam- 
ple, does it have back-annotation 
available for effective use of the 
simulator. 


The End of the Line 

To be sure, PLDs have changed 
dramatically. Fortunately, with a 
little common sense and the aid of 
the guidelines and formulas detailed 
here, designers can wade through 
the hype and mounds of vendor 
literatureto determinethe best FPLD 
for the application. It's important to 
keep in mind that all devices are 
different and can suit a variety of 
applications. Successfully choosing 
one category of device for Design A 
doesn't mean it will be the right 
device for Design B. Every applica- 
tion requires a recalculation and 
close scrutiny of the circuit. By 
closely examining the device tech- 
nologies, development tools and, 
above all, gate capacities, the op- 
timum device can easily be selected. 


1. Based on design experience. Smaller devices can usually use all available flip-flops. Larger devices cannot always take advantage of 


embedded flip-flops. 


2. Acteldevices contain no dedicated flip-flops. Any logic module may be used as a latch and any two modules may be used as a flip-flop. 
3. These devices contain input flip-flops which cannot be driven by internal combinatorial logic. The input flip-flops are counted only 


for the gates they contain. 


4. All gate counts assume 45% of users' gates are used for flip-flops. 
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Abstract — An architecture for electrically configurable gate arrays using 
a two-terminal anti-fuse element is described. The architecture is extensi- 
ble, and can provide a level of integration comparable to mask-programma- 
ble gate arrays. This is accomplished by using a conventional gate array 
organization with rows of logic modules separated by wiring channels. 
Each channel contains segmented wiring tracks. The overhead needed to 
program the anti-fuses is minimized by an addressing scheme that utilizes 
the wiring segments, pass transistors between adjacent segments, shared 
control lines, and serial addressing circuitry at the periphery of the array. 
This circuitry can also be used to test the device prior to programming and 
observe internal nodes after programming. By providing sufficient wiring 
tracks segmented into carefully chosen lengths and a logic module with a 
high degree of symmetry, fully automated placement and routing is facili- 
tated. 


I. INTRODUCTION 


ASK-programmable gate arrays offer the architec- 

tural flexibility and efficiency to integrate thou- 
sands of gates, but require long development time and high 
nonrecurring engineering costs. On the other hand, the 
convenience of field programming is available with pro- 
grammable logic device (PLD) technologies, but their ar- 
chitectures have not allowed integration of a wide variety 
of applications exceeding a few hundred gates [1], [2]. 

We describe a novel gate array architecture [3] which 
combines the flexibility of mask-programmable arrays with 
the convenience of field programmability. Its implementa- 
tion is made possible by a two-terminal electrically pro- 
grammable anti-fuse offering low resistance in its conduct- 
ing state and small area. 

The architecture supports a design style similar to con- 
ventional gate arrays, including fully automatic placement 
and routing algorithms attaining 85-95-percent utilization. 
This required considerable emphasis on symmetry and 
routability, which we touch on below. 

The anti-fuse is so called because it irreversibly changes 
from high to low resistance when “blown” by applying a 
programming voltage across it. The anti-fuse, or fuse for 
short, has an ON-state resistance of approximately 500 Q. 
The layout area of the fuse cell is generally limited by the 
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pitch of the first- and second-level metal lines that connect 
to it; it is about the same size as a via. 

This paper focuses on the architecture itself, which is 
fairly independent of the exact details of the particular 
CMOS technology and the anti-fuse. Other papers describe 
more fully the anti-fuse [4], a CMOS circuit implementing 
the architecture [5], and a study comparing the architec- 
ture's logic density to that of conventional gate arrays [6]. 


П. PROGRAMMABLE INTERCONNECT ARCHITECTURE 


The general architecture, shown in Fig. 1, exhibits the 
familiar gate array organization: rows of logic cells inter- 
spersed with routing channels. There are, of course, several 
key differences. 

The tracks in the channels are not simply empty areas in 
which metal lines can be arranged for a specific design. 
Rather, they contain predefined wiring “segments” of vari- 
ous lengths. Other wiring segments pass through the chan- 
nels vertically. Each input and output of a logic module is 
connected to a dedicated vertical segment. Other vertical 
segments just pass through the modules, serving as 
feedthroughs between channels. (The number and lengths 
of segments in Fig. 1 are only suggestive.) 
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Fig. 2. Horizontal fuse programming. 
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Fig. 3. Cross-fuse programming. 


A fuse is located at each crossing of a horizontal and 
vertical segment. Programming one of these “cross fuses” 
provides a low-resistance bidirectional connection between 
the segments. Other fuses are located between adjacent 
horizontal segments within a track. When blown, these 
“horizontal fuses” connect the two segments to form a 
longer one. (Although not shown in the diagram, fuses 
may also be provided to connect adjacent vertical seg- 
ments.) 

In order to program a fuse, we need to apply high 
voltage across it. This is accomplished by an efficient 
addressing scheme that uses the wiring segments them- 
selves, pass transistors connecting adjacent segments, and 
control logic at the periphery of the array. Fuse addresses 
are shifted into the chip serially. 

As shown in Fig. 1, each column of “horizontal pass 
transistors” connecting horizontal tracks is controlled by a 
shared “horizontal control” line running across the array. 
Each row of “ vertical pass transistors” is controlled by a 
“vertical control” line. The peripheral circuitry can drive 
the control lines and the segments at the end of each track. 

Horizontal fuse programming is quite simple. In the 
example of Fig. 2, we apply programming voltage V,, 
across the fuse F,. All horizontal control lines except the 
one in the column containing F, are turned on by connect- 
ing them to Ирр, and the appropriate track segments are 
driven to GND and Ирр as shown. (Vertical fuses, if 
present, are programmed similarly.) Cross-fuse program- 
ming uses both horizontal and vertical control lines as 
shown in Fig. 3. Segments not driven to either GND or Vpp 
are left precharged to Ирр /2. Thus the voltage across fuses 
not being programmed is either zero or Vp, /2. 

Some care is required to assure that a unique fuse is 
addressed. Fig. 4 shows how previously blown fuses can 
divert current along a “sneak path,” in this case program- 
ming fuse F, through previously blown fuses F, and F, 
instead of programming F,. Fortunately, we are not inter- 
ested in blowing an arbitrary pattern of fuses (this is not a 
PROM!). For example, we are not concerned with pro- 
gramming a pattern that connects two outputs together 
since this does not form a useful net. If we consider only 
relevant patterns, it can be shown that programming the 
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Fig. 4. A sneak path. 


TABLE I 


3 input NOR 
4:1 mux, non-inverting 
D latch with clear 


D flip-flop with clear/set 
full adder 





fuses in a carefully chosen order eliminates sneak paths. In 
general, fuses must be programmed starting from the cen- 
ter of the chip and moving outward, channel by channel. 
Determining the proper order is a bin sort operation, and 
can be done by software in linear time. 

The pass transistors and peripheral control logic are also 
used to test the chip; this is discussed in detail later. 


III. CHOICE OF THE LOGIC MODULE 


As outlined so far, the programmable interconnection 
architecture could be used with a variety of logic modules. 
Which would be best? This turned out to be a very 
difficult question, involving subtle trade-offs among 
routability, the logical capability of the module as per- 
ceived by the user, and delays due to capacitive loading in 
the routing segments. 

The complexity of the module must be balanced with 
the routing overhead. Mask-programmed gate arrays pro- 
vide very flexible and efficient routing. They therefore use 
a simple four-transistor cell. On the other hand, routing is 
very expensive in both area and delay with present pro- 
grammable logic arrays. These generally use a module 
capable of implementing more complex functions [2]. The 
architecture outlined here has a cost of routing closer to a 
conventional gate array, suggesting a logic module of inter- 
mediate size. Because this is about the same complexity as 
conventional gate array hard macros, the designer can use 
a library like the familiar gate array cell libraries; there is 
no need to map logic into a more complex module. Table I 
lists several typical gate array macros and the numbers of 
four-transistor cells and logic modules required to imple- 
ment them. 
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Fig. 5. Module function. 


Our chosen module, shown in Fig. 5, has eight inputs 
and a single output. It is composed of three two-to-one 
multiplexors, with an OR gate on the last stage’s select 
input. Various macros, such as those in the table, are 
implemented by using an appropriate subset of the inputs 
and tying the remaining inputs high or low. Thus the 
module can implement all macros with two inputs, most 
with three inputs, many with four inputs, etc. 

The module’s output is connected to a vertical segment 
spanning several channels. Each input is connected to a 
short vertical segment spanning one channel. Four of these 
span the channel above the module, four the channel 
below. The use of short segments for the inputs reduces 
parasitic capacitance and hence delay. 

Note that each input is accessible from either the chan- 
nel above or below but not both. At first, this would 
appear to limit routability compared to a conventional 
“double-entry” gate array cell, in which signals may enter 
from either channel. However, there is nearly always more 
than one way to implement a macro. For example, there 
may be up to four distinct ways to implement a two-input 
gate: with both signals connecting to inputs in the top 
channel, with both signals connecting to inputs in the 
bottom channel, with one signal in the top and the other in 
the bottom channel, or vice-versa. 

By letting the router choose an implementation that uses 
inputs accessed from convenient channels, the benefits of 
full double-entry symmetry are approached or sometimes 
attained. The degree of symmetry possible for a particular 
macro m implemented in a given module is reflected in the 
following measure S: 


S(m) = log; (N(m)) 


where N(m) is the number of distinct possible implemen- 
tations of the macro m. Full double-entry symmetry would 
correspond to a value S(m) equal to the fan-in of the 
macro. To evaluate the overall symmetry of a module, we 
average S(m) over the macro library, weighted by relative 
macro usage U(m) and the fan-in F(m): 


Y U(m)S(m) 
LU(m) F(m) 
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Fig. 6. А routing path. 


This is the effective fraction of macro inputs in a typical 
design that have double-entry symmetry, and is an impor- 
tant criterion for choosing a module. 


IV. ROUTING 


Fig. 6 illustrates the routing of a net. The vertical 
segment connected to the driving module’s output is con- 
nected by cross fuses to horizontal segments, which in turn 
connect to the segments associated with module inputs. In 
the top channel, a horizontal fuse is used to link two 
segments into a longer one. 

The resistance of the blown fuses and the parasitic 
capacitance of the segments used form an RC tree, with 
the driver of the net as the root. Note that each input is 
driven through a maximum of three and generally two 
fuses to limit the delay. (If the number of series stages in 
the RC tree were allowed to increase further, the delay 
through the routing would increase rapidly.) The maxi- 
mum number of fuses and the segment lengths (hence 
capacitances) can be altered to suit the chip dimensions 
and the resistance of the fuse technology. 

In rare instances, it is not possible to place the macros 
so that all inputs on the net lie within the channels 
spanned by the output segment of the driving module. To 
handle this case, a few additional uncommitted long verti- 
cal segments are provided. The net is then routed from the 
output segment to a horizontal segment, then to the long 
vertical segment, then to another horizontal segment, and 
finally to the necessary input segments. To keep the num- 
ber of fuses in series limited to four, no horizontal fuses 
are allowed in such nets. (If necessary, the architecture can 
be extended to provide a special fuse connecting the out- 
put directly to a long vertical segment passing over the 
driving module, thus eliminating the first horizontal seg- 
ment and reducing the total number of fuses in series back 
to three.) 

A means must also be provided to connect internal 
signals to the bonding pads of the chip. Each pad has a 
dedicated bidirectional buffer, which connects to the array 
through an associated 1/O module. The 1/O modules fit 
in the outer columns and rows of the array next to the 
logic modules. Each 1/O module has two inputs, data and 
enable, and an output. The data and enable signals are 
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sent to the output buffer of the associated bonding pad, 
and the module’s output comes from the input buffer of 
the pad. Thus the I/O module can be configured to 
provide input, output, tristate, or bidirectional capability. 

To minimize clock skew due to differential routing de- 
lay, one entire track (or more if needed) in each channel is 
set aside for clock distribution. These tracks are connected 
directly to buffers, so that each input presents a similar 
load driven through exactly one fuse. 

An interesting theoretical question is whether more hori- 
zontal tracks are needed in each channel here (where the 
lengths of the wiring segments must be predetermined) 
than in mask-programmed routing (where the wiring is 
customized for the design). Surprisingly, a high probability 
of routability is obtained with only a few tracks above 
channel density. 

This requires a careful choice of the lengths of the 
segments, based on statistics from an extensive suite of 
design examples. This was done by first determining the 
distribution of net lengths, i.e., the length each net would 
run along each channel if the constraint of fixed segmenta- 
tion were absent (as in a conventional gate array). The 
distribution of physical segment lengths provided on the 
chip was chosen to obey similar statistics. Then the seg- 
mentation was “tuned” manually based on actual routings 
which obeyed the constraints it imposed. 

To obtain good routing performance it is also necessary 
to take advantage of the symmetry of the macros where 
possible. For example, observe that if macro 4 in Fig. 6 
permits its input to be routed from either the upper or 
lower channel, there is a better chance of finding a free 
horizontal segment to connect it. 


V. TESTING 


To assure high programming yield, it is necessary to 
thoroughly test the chip for defects in the modules and 
fuses prior to programming. With a simple addition, the 
addressing circuitry used for programming suffices for this 
purpose as well. 

Continuity of the tracks is easily verified by turning on 
all vertical and horizontal pass transistors, and using the 
peripheral circuits to drive the tracks from one end and 
tead them back from the other. Testing for the absence of 
shorts between adjacent tracks is done in a similar way by 
applying a pattern of alternating ZERO’s and ONE’s. 

Shorted or weak cross fuses are detected by turning on 
all horizontal and vertical pass-transistor lines, grounding 
all horizontal segments, and driving all vertical segments 
to some stress voltage. Horizontal fuses are tested column 
by column, with the same addressing method that is used 
to program them. 

To verify the functional operation of the modules, we 
need to apply test vectors to their inputs and read their 
outputs. A vector is applied simultaneously to an entire 
row of modules by turning on all vertical pass transistors 
except those in the row being tested. Data are applied to 
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Fig. 7. Probe circuit. 


the inputs in the channel above the row from the periphery 
at the top of the array, and to the inputs in the channel 
below the row from the bottom of the array. 

Since the outputs of the modules share a vertical track 
with outputs of other modules above and below them, 
some other means is required to read the module outputs 
at the array periphery. As shown in Fig. 7, a select line is 
provided along each row of modules, and a sense line 
along each column. Activating the select line for the row of 
modules under test gates their output values onto the sense 
lines. The sense lines are loaded into a shift register at the 
top of the array. 

This ability to read the output of any module at the 
array periphery is highly useful after programming as well. 
Only a small amount of extra circuitry is required to select 
one of the sense lines and make its value available at an 
external pin of the chip. Thus by shifting in the appropri- 
ate address, the user can observe any internal node of his 
design externally in real time. This virtual probe can be 
used and its address changed even as the programmed chip 
is operating in the user’s system. 


VI. IMPLEMENTATION: SILICON AND SOFTWARE 


The architecture has been implemented in a CMOS 
device. For details, including the speed of the module in 
isolation and in an application, see [5]. 

Computer-aided design tools have been developed to 
support the architecture. Designs are entered as schematics 
or net lists using a cell library. 

The placement and routing algorithms are specific to the 
architecture. As usual these are time consuming, taking up 
to a few hours on a low-cost workstation. They achieve 
100-percent routing completion. (Even expert users have 
never been able to improve manually on the automatic 
router.) The probability of successful routing can be pre- 
dicted by analyzing some statistics of the design. 

Because the nets are RC trees, delays are not a simple 
function of capacitive load as with mask-programmed gate 
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arrays. Nevertheless, we are able to quickly calculate pre- 
cise delays at each input for post-layout simulation and 
timing verification. 
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ABSTRACT 


This paper describes a Programmable Low 
Impedance Circuit Element (PLICE), which is a 
dielectric based antifuse for use: in both 
logic and memory ICs. The antifuse element 
offers significant size and perforsance 
{mprovement compared to other prograsmable 
cells. A simple thermal model has been 
developed to predict the antifuse resistance, 
Each antifuse occupies an area of 15 um 
using L2um technology. It can be programmed 
within 1 msec, and has a tight resistance 
distribution centered around 500 ohms. The 
reliability of both the programmed and 
unprogrammed states is demonstrated to be 
better than 40 years. The antifuse was used 
in the design of the first family of desktop- 
configurable channeled gate arrays and a 64K 
PROM device. 


INTRODUCTION 


PROMs and programmable logic devices commonly 
employ programmable elements such as fusible 
links, EPROM or EEPRON cells. These 
programmable elements are either large in 
area, require high programming current, or are 
three terminal devices. This paper describes a 
Programmable Low Impedance Circuit Element 
@LICE), which is a dielectric based antifuse 
that offers significant size and perforsance 
improvements over other programmable 
elements. The antifuse structure, technology, 
characteristics, thermal model and reliability 
are described below. 


ANTIFUSE STRUCTURE AND TECHNOLOGY 


The PLICE antifuse is a dieleotric between an 
n* diffusion and poly-Si as shown in the SEM 
cross-section (Fig. (D and is cospatible with 
CHOS and other technologies such as bipolar 
and BIMOS. The PLICE element was integrated 
in a standard 12 mask, double layer metal twin 
tub CMOS technology. Three additional masks 
were required; n* antifuse diffusion, antifuse 
poly, and 40 na oxide mask for the high 
voltage transistors, bringing the total to 15 
masks. Four transistor types (TABLE D are 
used; the low voltage high speed transistors 
are used for the logíc and signal path while 
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the high voltage transistors are used for the 
programming path. Fig. (2 is a photomicrograph 
of a 2000 yate programmable gate array 
utilizing 186,000 antifuses. There are roughly 
one hundred antifuses for each logic gate to 
achieve a high degree of interconnectivity and 
gate utilization. Each antifuse occupies an 
area equivalent to a contact or via. The 
antifuses are incorporated into a cell 
structure such that when the antifuse is 
prograssed, it connects a metal 1 and metal 2 
line (1,21. The cell structure size is thus 
limited by the metal 1 and metal 2 pitch 
Fig. (9 is a photomicrograph of a 64% PROM 
designed using the same technology with a 
typical access tise of 35 neec. 


ANTIFUSE CHARACTERISTICS 


When 16 volts is applied across the antifuse 
through X-Y select transistors (1,21, the 
antifuse is programmed to the comduotive 
state in about 200 usec as shown in Fig. (42, 
Fig. (5) shows a tight resistance distribution 
of antifuses programmed with a current of 
5 mA. 


ANTIFUSE THERMAL MODEL 


Once the dielectric is ruptured, the 
resistance R, of the conductive state is 
determined by the size of the conductive 
conduit (link. The size of the link is 
determined by the amount of power dissipated 
in the link whioh melts the dielectric. Since 
the temperature of the molten core varies 
inversely vith its radius, the molten core vili 
expand until its temperature drops below the 
melting point of the dielectric. Since the size 
of the link is much smaller than the thickness 
of the conductive silicon layers on both sides 
of the dielectric, the temperature gradient 
and the resultant heat oomduction can be 
modelled by a simple sphere. The resuitant 
equation for the link temperature Ty 
Tyr LV Ankur א א‎ S | 

Where I, is the programming current, V, is the 
voltage across the link, гу is the radius of 
the link and k, is the thermal oonductivity 
of silicon, The calculation however becomes 
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complioated if the thermal conductivity of 
silioon is taken into consideration as a 
function of its temperature [3] Furthermore, 
the power dissipation is distributed 
throughout the sphere. Hence a computer 
simulation that breaks the sphere into 1 nm 
thick conductive shells was used to calculate 
the link temperature T, and resistance R, vs 
programming current I The resultant 
caloulations are plotted Mm Fig. (0. The link 
radius ןת‎ is determined by the equilibrium of 
power dissipation and melting temperature, as 
expressed in the following equation [41 


rh VT + /2:/67 6 .2 


where f, Sa are the resistivities of the link 
and silicon respectively during programming 
and the relation between link resistance R 
and link radius гү can be obtained ав 
#оПомвг4): 


RODA, ..............(3) 


From eqn(2) and eqn(3) we derive that R) is 
thus inversely proportional to I, (Fig. (68), 
given by the following equation 


R= 67/1 ж tL Tr yz ..‹4) 
This can be further staplified to: 


)5(............... .= תא 
At 5 mA, for example, the link radius is about‏ 
nm and the fuse resistance is 500 ohms -‏ 20 
(approximately the impedance of а ZOum wide‏ 
EPROM cell) in its conductive state. The‏ 
programmed antifuse resistance is a function‏ 
of the reading current as shown in Fig (7).‏ 
The resistance rises at a current just before‏ 
the programming current, due to heating of‏ 
the conductive channel; drops when the silicon‏ 
in the conductive channe! melts; and continues‏ 
to decrease beyond the original resistance as‏ 
the current exceeds the programming current.‏ 
This is due to the permanent widening of the‏ 
conductive link.‏ 


ANTIFUSE RELIABILITY 


Programmed antifuse reliability was evaluated 
using discrete antifuses incorporated in a 
Kelvin structure (Fig. (8). The structures 
were stressed with a 5 mA current between 
two of the terminals 1, 2 at a temperature of 
250C. Fig. (9) is a plot of the monitored 
voltage across the programmed antifuse as a 
function of stress time. Failure is indicated 
by an increase in voltage as the structure 
becomes open Measurement of the antifuse 
through the other two terminals 3, 4 indicated 
that the programmed antifuse did not fail or 
exhibit any measurable resistance change. 
Investigation of the failure using SEM 
indicated that the failure is due to poly 
contact electromigration. With 0.9 ev 
activation energy [5] for contacts, the 
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predicted lifetime is more than 40 years. 
Fig. GO) is a plot of time to breakdown vs 
Velectric-field for unprogrammed discrete 
antifuses. Extrapolation of Fig. (0) indicates 
that the lifetime of an unprogrammed antifuse 
under continuous 5.5v stress exceeds 
100 years, with a failure rate less than 
1 FIT. In addition to the accelerated discrete 
antifuse device reliability data, 964 product 
units have accusulated a total of 571,000 
device hours of dynamic burn-in at 5.5 volt 
and 125C (with some units reaching 2500 hours) 
vith no failures or change in a.c. 
characteristics. This is equivalent to a 
failure rate of less than 100 FIT. 


SUMMARY 


In summary we have presented and modelled a 
reliable high performance dielectric based 
antifuse which can be Programmed with 
relatively low current and is cospatible with 
CMOS and other technologies. The cell read 
current, 2 mA at 1 volt, is an order of 
magnitude higher than that of an EPROM of 
comparable size. There is no data (charge) 
retention concern, The unique combination of 
small size and lov resistance has enabled the 
development of the first family of desktop- 
configurable channeled gate arrays and a 8% 
PROM device. 
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TABLE 1 TR E SEGUE) 
TECHNOLOGY OVERVIEW запат AL пап 


PROCESS:TWIN ТИВ DOUBLE LAYER METAL |? 1 
CMOS 15 MASKS 5 1 

7 i 

I 


TRANSISTOR OXIDE LEFF 


NLOW VOLT 25nm 1.1 um 
PLOW VOLT 25nm 1.2um 
NHIGH VOLT 40nm 1.5 um 
P HIGH VOLT 40nm 1.8um 


ANTIFUSE CHRACTERISTICS: 


PROGRAMMING VOLTAGE 18 V 
PROGRAMMING TIME «1 mS 
PROGRAMMING CURRENT «< 10 тА 

ОМ RESISTANCE < 1K OHMS 
OFF RESISTANCE > 100M OHMS 





Fig.(3) Photomicrograph of 64K PROM 





Fig.(1) SEM cross-section of antifuse. 
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PROGRAMMING TIME (из) 


Fig.(4) Programming time of antifuse 
with 18V programming voltage. 





Fig.(2) Photomicrograph of 2000 gate 
programmable gate array. 
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Fum илиништи» (ohms) 


Programmed Antituse Resistance Histogram 
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Fig.(5) Programmed antifuse resistance 
histogram, programmed with 5 mA current. 
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Fig.(6) Programmed antifuse resistance 
versus programming current. 
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Fig.(7) Programmed antifuse resistance 
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Fig.(8) Antifuse Kelvin structure. 


ANTIFUSE ACCELERATED LIFETEST 
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Fig.(9) Voltage across antifuse versus 
stress time, with 5 mA stress current. 
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Fig.(10) Time to breakdown versus inverse 


electric field. 
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With the explosion 
of higher density 
PLDs and the ad- 
vent of even higher 
density field-pro- 
grammable gate ar- 
rays (FPGAs), the 


and issue of thorough 

Ken Hayes, 5 testing will 

\ come evermore 

Product Engineering, important. That's 
Manager because inad- 
Actel Corp. equate device test- 


ing prior to use 
places a severe 
burden on the system manufacturer. 
To try to compensate for inadequate 
factory testing, the system manufacturer 
can institute testing at incoming inspec- 
tion—a difficult task for complex LS] de- 
vices. If he doesn't, he must rely on board- 
level testing, or worse, system-level 
testing. And, inevitably, some faulty de- 
vices will escape to become field failures. It 
is well-known the cost of finding failures 
rises dramatically at each of these stages. 
Masked-programmed gate arrays are at- 
tractive for their low recurring costs, high 
gate counts and speeds. However, one 
little discussed disadvantage is that the 
quality of test coverage is the responsibility 
of the user. A thorough job of test-vector 
generation requires the use of fault grading 
or generation of automatic test vectors. 
Neither is very common in practice. Fault 
grading is generally expensive and adds 
complexity to the design Process. Auto- 
matic vector generation is simplified if a 
scan-design technique is adopted, but few 
engineers are willing to conform to the 
required design constraints. Automatic 
vector generation without scanning tech- 
niques is still a developing discipline. 
However, poorly tested devices are the 
likely result of inadequate fault coverage. It 
has been shown that typical masked gate 
arrays with 70 percent fault coverage are 
likely to have 0.1 to 0.2 defects per device. 


Many field-programmable devices largely 
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appointed users with functional defect rates 
of more than 10 percent for those parts that 

This has been improved substantially by 
the addition of on-chip test circuits to defect 
rates of 0.5 percent to 1 percent . While this 
Tepresents a low defect rate for individual 
chips, putting 10 such chips on a single board 
contributes to a 5 percent to 10 percent 
board-failure rate. Further improvement is 
difficult due to the fact that in conventional 
one-time programmable architectures, com- 
plete functional testing of all circuits in an 
unprogrammed part is not possible. In such 
fuse-based architectures in the unpro- 
grammed state, many circuits are shorted 


reprogrammable log- 
ic devices based on EPROM, Е'РКОМ and 
RAM programming elements has raised 
functional quality levels to nearly 100 per- 
cent. However, defect rates do not fall to 
zero because of post-testing failure mecha- 
nisms such as incorrect programming, 
marking, insertion and ESD damage. 


Non-volatile testing 
With at least one non-volatile, one-time pro- 


100 percent fault coverage. 

This testability is independent of the 
large number of equivalent gates (1,200 in 
the ACT 1010 and 2,000 in the ACT 1020). 
In order to describe how this is accom- 
plished, a few points about the ACT 1 
architecture need review. 

The basic building block on the chip is 
the logic module, which is programmable 
and capable of implementing all two-input 
logic functions, most three-input functions, 
and some four- and six-input functions. A 
single module implements a variety of 
latches, and two modules implement more 
than 32 types of D and J-K flip-flops. Each 
module is effectively the same complexity 
as a conventional gate-array macro. The 
ACT1020 contains 546 of these logic mod- 
ules. In order to achieve 85 percent to 95 
percent utilization of these numerous small 
building blocks, a large number of routing 
tracks and programming points are re- 


quired, comparable to those found in a 
conventional array. These are arranged in 
horizontal rows and vertical columns. 

Horizontal routing tracks alternate with 
the rows of logic modules. Vertical routing 
tracks run over the top of the logic modules 
and horizontal tracks. Around the outside 
of chip are VO circuits, test circuits, pro- 
gramming circuitry and a shift-register- 
based antifuse addressing scheme. 

The Actel-invented Plice antifuse was 
specifically developed for making logic 
interconnections. An antifuse is any 
programming element that is normally 
open and makes a connection upon the 
application of a programming sequence. 
The antifuse has a natural advantage 
over fuse-based programming with re- 
spect to testability. Normally open in 
the unprogrammed state, antifuses do 
not short chip circuitry together. 

On the contrary, all elements of the 
chip can be viewed as isolated building 
blocks. By building in special on-chip 
circuitry, all architectural elements can 
be thoroughly tested prior to shipment 
and programming. Much of this circuitry 
is also required for programmability and 
normal chip operation, so that testabi- 
lity overhead is modest. 

This multiplicity of function is especially 
evident for the routing tracks. Routing 
tracks both horizontally and vertically are 
broken into segments of various lengths. 
This makes possible the use of segments of 
appropriate lengths for interconnections. 
Pass transistors are present between 
these segments and can be used to form 
temporary connections between adjacent 
segments. 

This allows the routing tracks to be used 
for carrying programming voltages, ac- 
cessing chip elements for testability pur- 
poses, as well as providing their primary 
function as logic routing resources. The 
pass transistors are not used during normal 
device operation and do not contribute to 
logic-signal propagation delays. The ac- 
companying figure shows a simplified view 
of logic modules, routing tracks and pass 
transistors. 


in the test modes 

The architecture of the ACT 1 arrays al- 
lows outstanding testability of unpro- 
grammed devices. All elements of the chip 


can be thoroughly tested at the factory with 
the exception .of the programmability of 
specific antifuses needed for a particular 
user-defined application. However, these 
can be automatically tested during the pro- 
gramming sequence. Thus, a complete test 
is performed on each device without the 
need for user-generated test vectors. 

Testing begins with the shift-register- 
based addressing scheme. Through the 
shift register, commands can be shifted 
into the device, enabling special modes 
such as programming and functional test- 
ing. In addition, the shift register can ad- 
dress specific antifuses, logic module out- 
puts, and read or drive voltage levels on 
specific routing tracks. The shift register 
can be both up-loaded and down-loaded by 
means of a serial shift clock and a data pin. 

This setup allows various patterns to 
be shifted in and out again, verifying full 
shift-register functionality. With the 
shift register operational, all vertical 
and horizontal routing tracks can be 
tested for continuity and shorts. These 
tests also ensure that all vertical and 
horizontal pass transistors will turn on. 
Further tests on these pass transistors 
are performed to ensure that they have 
satisfactory parametric values. 

The ACT 1 arrays contain a dedicated 
clock buffer that crosses the chip in the 
horizontal routing channels. This buffer can 
be tested by driving the chip's clock pin and 
reading the proper levels on the sides of 
the array via the shift register. 

The arrays have two special pins that can 
be used as user VO channels or, in the test 
mode, as outputs of two on-chip probes 
that can route the output of any logic mod- 
ule to the external pins. This is possible on 
either programmed or unprogrammed 
parts, and selection of probe points does 
not require the programming of antifuses. 
In fact, the probes are reprogrammable and 
can be steered under software control to 
any two points in the chip. 

These probes are also available to the 
user through design-verification software, 
which is a standard part of the Actel design 
system. The probes allow a great many 
tests to be performed that would otherwise 
be impossible. 

For example, all input buffers can be 
tested by driving them low on the outside 
of the chip and probing the internal side of 
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the buffer. Built-in test modes also permit 
driving all output buffers to high, low or 
high-impedance. So it is possible to test 
Vol, Voh, 101, Ioh, Vil, Vih and leakage on 
all VO lines. 

One of the keys to thoroughness with 
which ACT arrays can be tested is the 
module test. Logic modules, in their unpro- 
grammed state, can be viewed as eight- 
input/one-output combinatorial functions. 
By using the horizontal and vertical pass 
transistors, any of the eight module inputs 
can be forced to a high or low. The output 
of the module can then be read through 
either probe pin. In this way, every logic 
module on the chip can be tested with a 100 
percent fault-coverage vector set. No anti- 
fuse programming is required to perform 
these tests. In addition, the architecture 
allows modules to be tested in parallel for 
reduced test time. 


Amifuse ебад 

The consistently high utilization and 
flexibility of ACT 1 FPGAs is due in large 
part to the copious number of routing 


tracks on chip. This, in turn, requires nu- 
merous programming elements. The small 
size of the Plice antifuse makes sufficient 
numbers possible. The ACT 1020 chip, 
with its 2,000-gate equivalents, contains 
186,000 antifuse programming elements. 

Surprisingly, only 2 percent to 3 percent 
need be programmed to fully utilize a chip. 
Nonetheless, antifuse testing must play a 
significant part in the overall device test 
methodology. 

The first antifuse test is the blank test. 
The entire array of antifuses is easily test- 
ed to ensure that antifuses are in their 
normally open state. The next and most 
important antifuse test is the accelerated 
life test. Dielectric breakdown in Plice anti- 
fuses is a strong function of voltage, and 
this effect can be used to perform an effec- 
tive life-test acceleration screen in place of 
the more commonly used temperature-ac- 
celeration approach. 

In addition, the convenience of a voltage- 
acceleration factor facilitates the perfor- 
mance of a life-test screen on every chip 
and its antifuses as part of the normal test 


sequence. The acceleration voltage is ap- 
plied through the programming voltage pin 
(VPP). After this test, the blank test is 
again run to detect any antifuses that have 
failed. 

This test is very effective at catching 
antifuse defects. Using devices screened in 
this way, more than 700,000 device-hours 
have been completed in 125? C dynamic 
burn-in reliability testing with no observed 
antifuse failures. 

Several tests are used to confirm that 
programming circuitry works correctly. 
The first such test is a basic junction stress 
and leakage test. The programming mode 
is enabled, and the programming voltage 
(VPP) plus a guard band is applied to the 
chip. No voltage is applied to the antifuses. 
If programming current (IPP) exceeds a 
normal value, the device is rejected. The 
functionality of the programming circuitry 
can be further verified by programming 
various antifuses associated with special 
test structures and other non-user accessi- 
ble features. 

A number of logic modules exclusively 


available for test purposes are connected to 
each other by programming antifuses. This 
structure also can be used for AC-perfor- 
mance characterization of the chip. In addi- 
tion, a number of antifuses are configured 
as a readable signature word. Bits in this 
word are programmed with factory-defined 
patterns. Successful completion of these 
tests ensures basic functionality of on-chip 
programming circuitry. 

The description of the numerous ACT 
1010 and 1020 test modes attest to the 
outstanding testability of these devices. As 
a result, statistical sampling currently 
shows a programming yield of better than 
98 percent. 

Even more important, for all sampled 
parts that successfully passed program- 
ming, no functional failures have been ob- 
served. 

These tests were conducted with de- 
signs that utilized 92 percent to 97 percent 
of the logic modules. Although this utiliza- 
tion is high, it is not infrequent among 
users of Actel arrays. 
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FouR BASIC APPLICATION CIRCUITS ACCURATELY 
PREDICT IF An ASIC DESIGN CAN BE IMPLEMENTED 


COMPARE ASIC CAPACITIES 

















DESIGN APPLICATIONS 


IN A SPECIFIC GATE ARRAY 


WITH GATE ARRAY 
BENCHMARKS 


hen evaluating a gate array’s capacity, designers 
typically concentrate on only the circuit they are 
working on. The trouble here is that this evalua- 
tion says nothing about whether the gate array 
will implement the designer’s next circuit, or the 
one after that. 

A functional method of testing gate array capacities must consider the char- 
acteristics of the different circuits being implemented. ASIC designers must 
know if a circuit can fit into a certain gate array. The solution: benchmarking, 
which takes routability and application differences into consideration. Toward 
this end, Actel has developed a set of application circuit benchmarks that give 
design engineers accurate predictions of ASIC capacities. 

ASIC capacities are currently measured in terms of equivalent gates. An 
equivalent gate is a two-input NAND gate made up of an array of unconnected 
transistors. For example, manufacturers of CMOS gate arrays calculate the 
amount of equivalent gates simply by counting the number of transistors and 
dividing by four—the typical number of CMOS transistors per gate. This ap- 
proach to calculating gate-array capacities is fairly reliable and consistent with- 
in a limited class of devices, but it doesn’t relate to a meaningful standard that 








1. A MEASURE FOR a benchmark circuit unit is taken by mapping the unit into a gate 
array as often as possible in a step-and-repeat procedure. 


BOB OSANN AND ABBAS EL GAMAL 
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To other benchmark units 


From other! benchmark units 


benchmark unit 
8 





BS; 2. 248 GATES ARE needed to implement the timer-counter benchmark circuit. This timer is typical of microprocessor-based systems. 


applies across all ASIC types. 

A better, more accurate measure 
of ASIC capacity isa benchmark that 
considers application differences. 
Considering the broad variation of 
applications, creating these bench- 
marks isn’t a trivial task. For exam- 
ple, some applications contain a 
great deal of combinatorial logie 
while others are mainly sequential. 
These inconsistencies mean that cer- 
tain application circuits map into dif- 
ferent ASIC gate-array types better 
than others. 

The key to making efficient use of 
an ASIC’s available gates is routabi- 
lity. Because all circuits aren’t equal- 
ly routable, numerous benchmark 
circuits are needed. Another factor 
in routability is the ratio of combina- 
torial to sequential logic—an impor- 
tant consideration for an ASIC that 
comes with a fixed number of flip- 
flop resources. 


PIN-PER-GATE COUNTS 

One way to roughly evaluate a cir- 
cuit’s routability is to consider how 
many macro pins-per-gate each mac- 
ro function in the circuit employs. A 
“macro pin” refers to the input and 
output connection points on a macro- 
logic function—it has no relation to 
an I/O pin on the device package. Se- 





quential logic tends to use many 
gates and relatively few pins per 
gate, while combinatorial logic uses 
a relatively high number of pins per 
gate approaching the maximum of 
three (two-input NAND). 

In general, the greater number of 
pins per gate, the more difficult the 
circuit is to route. However, the pin- 
per-gate routability measure is limit- 
ed: An average pin-per-gate figure 
doesn’t convey the local congestion 
that can arise in certain types of ap- 
plications, putting excessive strain 
onan ASIC’s routing resources. 

A useful evaluation is to actually 
route the various benchmark circuits 
into gate arrays. This approach is es- 
pecially effective when applied to 
ASIC architectures with routing re- 
sources less abundant than those of 
channeled gate arrays. Specifically, 
the benchmark circuits must offer 
divergent pin-per-gate ratings to 
give a valid picture of routability for 
different applications. 

Because most real-world applica- 
tions include a variety of circuit 
types with different routability lev- 
els, filling a chip with one type of cir- 
cuit might seem artificial. This is 
why there are four different bench- 
mark circuits each mapped into one 
chip. But even one application bench- 
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mark circuit alone gives a realistic 
measurement as most application 
circuits tend to be bimodal—their se- 
quential logic and combinatorial log- 
ic generally have very different pin- 
per-gate measurements. The combi- 
natorial logic sections are commonly 
routing-limited, while the sequential 
logic sections are not. The different 
benchmark circuits measure the 
ASIC’s ability to deal with each type 
of logic. 

An ideal benchmark circuit unit 
should be small enough to fit into an 
ASIC many times without leaving a 
sizable amount of unused logic. On 
the other hand, the units must be 
large enough to realistically create 
the routability challenges of actual 
application circuits. Also, the circuits 
should be designed such that no I/O 
limitations are encountered, allow- 
ing the benchmark focus to remain 
solely on logic capacity. 

Any useful benchmark circuit 
must relate to some type of stan- 
dard. The standard against which 
Actel measures its units is an LSI 
Logic gate array. Among the most 
widely used gate arrays in the indus- 
try, LSI Logic parts have become de 
facto industry standards. With cir- 
cuits constructed from LSI Logic 
macros, capacity requirements in 
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terms of two-input NAND equiva- 
lents can be quickly computed in a 
clear way. 

To take application variability into 
account, benchmark circuits must be 
designed to represent all the differ- 
ent types of application circuits. The 
CALC (comparative ASIC logic ca- 
pacity) benchmark developed by Ac- 
telis actually a set of four circuits, in- 
cluding a data path, timer-counter, 
State machine, and an arithmetic 
unit. The CALC benchmarks are an 
attempt at a broader, more disci- 
plined determination of an ASIC's 
capacity than an equivalent gate 
reading. 


STEP-AND-REPEAT 

Each benchmark circuit is used to 
create a measure by mapping it into 
an ASIC as many times as possible in 
a step-and-repeat procedure (Fig. 1). 
The results demonstrate how wellan 
ASIC accommodates the type of cir- 
cuit represented by the benchmark. 
The number of successful iterations 
supply a relative measure of device 
capacity when the same benchmark 
circuit and procedure is applied to 
several devices. 

The first CALC benchmark is the 
data-path circuit unit. Most board- 
level designs—even those that heavi- 
ly utilize programmable logic de- 
vices—use a significant number of 
data path devices. The purpose of the 


data-path circuit benchmark is to | 


steersignals from one point to anoth- 
er, and to occasionally store the sig- 
nals. Common TTL datapath func- 
tions include octal registers and 
latches, buffers, transceivers, and 
multiplexers. Shift registers used to 
perform parallel-to-serial and serial- 
to-parallel conversions are also con- 
sidered a form of data-path circuit 
function. 

This data-path benchmark circuit 
employs a 4-to-1 multiplexer, an 8-bit 
register, and an 8-bit shift register. 
The octal 4-to-1 multiplexer—similar 
to an LS153 multiplexer—selects 8 
bits from a 32-bit input bus. This 
function approximates the many dy- 
namic-RAM-based systems in which 
а 4-60-1 multiplexer selects between 
the processor address and the re- 
fresh address. Eight of the 32 data 











bits entering the multiplexer are 
kept separate in order to allow data 
to be received from a preceding 
benchmark unit when placing multi- 
ple, daisy-chained circuit units into a 
device. 

The outputs of the multiplexer 
feed a continuously-clocked, 8-bit 
holding register. The register then 
feeds an & bit parallel-in/serial-out 
shift register, similar to the popular 
LS166 register used in many video 
designs. 

The benchmark shift register also 
has a parallel 8-bit output used for 
the daisy-chain connection to the 
next benchmark unit. The shift reg- 
ister's operation is controlled by its 
Shift/Load signal, which connects to 
the Clock Enable inputs of the regis- 
ter's flip-flops. 

The pin-per-gate distribution indi- 






From other benchmark units 
1 


Reset Clock Data 









To other 
benchmark units 


3. THIS THIRD circuit, an average- 


sized controller/sequencer, contains a 
large amount of non-standard logic. The 
outputs of this state machine are 
determined by its state diagram (see the 
insert). 





BENCHMARK CIRCUITS IMPLEMENTED 





cates many sequential macros with 
0.5 pins per gate, and on the other 
end of the scale, 4:1 multiplexers 
with the maximum of three pins per 
gate. When implemented in an LSI 
Logic LL7220 gate array, each data- 
path benchmark circuit needs about 
157 gates. 

The second benchmark circuit is 
the timer-counter (Fig. 2). This par- 
ticular circuit is typical of the many 
timer functions found in micropro- 
cessor-based systems. While dedi- 
cated timer-counter chips such as the 
8253 are popular, they are too gener- 
al for many ASIC applications and 
would require too many gates. A bet- 
ter solution is to build a specific tim- 
er-counter function. 

The multiplexer used in this circuit 
is similar to an LS157 multiplexer 
and allows the LS161-type counter to 
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From other benchmark units 
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Reset Clock Data 
To other benchmark units 


DESIGN APPLICATIONS 


ASIC 
BENCHMARKS 





4, THE ARITHMETIC CIRCUIT, a basic multiplier/accumulator, employs the 






be loaded in two different ways: di- 
rectly from incoming data or from 
the preset value register. The de- 
sired preset data is loaded into the 
counter when the “equal-to” com- 
parator indicates a match between 
the counter’s state and the value ina 
compare register. 

Both the preset value and the com- 
pare value registers are similar to 
LS377 octal registers with Clock-En- 
ables except that these also have an 
asynchronous reset function. The 
timer-counter’s basic sequence of op- 
erations (count, compare, load) is re- 


5. A COMPARISON OF pin 
density distributions is a good measure of 
routability. High pins-per-gate circuits 
are generally difficult to route. 


highest number of gates out of all the benchmarks—295. 













peated endlessly. 

The timer-counter circuit employs 
an intense amount of registers and 
exclusive-Or (XOR) gates. Registers 
hold the preset and compare values 
as well as implementing operation of 
the counter. The XOR gates, which 
are low pin-per-gate functions, are 
used in the counter and compare 
functions. 

In the pin-per-gate distribution, 
the sequential functions have less 
than one pin per gate, and the simple 
gates have about 2.5 pins per gate. 
The timer-counter benchmark circuit 
requires about 248 gates to imple- 
ment in an LL7220 gate array. 

Benchmark number three is the 
state machine circuit representing 
an average-sized controller-sequenc- 
er that’s found in many PLD applica- 
tions (Fig. 3). Because of the large 
amount of nonstandard logic in- 
volved, this is an important applica- 
tion for an ASIC. E 

This third benchmark circuit is a 
Mealy machine, whose outputs are 
determined from the total state of 
the system. The outputs are regis- 
tered and clocked to produce syn- 
chronous output data. The machine 
has eight states and goes through 12 
transitions. 

The circuit's distinctive feature as 
a benchmark is its use of many wide 
decode stages (12 bits wide) in the 
combinatorial logic section, which 
demands a relatively large number 
of pins per gate. Consequently, this 
benchmark tests for how well an 
ASIC can handle highly congested 














circuits with high fan-in on each mac- 
ro. Because they generally have a 
limited number of internal pins per 
module, the circuit poses a challenge 
to module-based ASICs. 

Many LS273-type registers are in- 
cluded on this benchmark circuit. 
The combinatorial portion of a typi- 
cal state-machine circuit has an ex- 
tremely high pin-per-gate rating. 
The state-machine benchmark uti- 
lizes about 153 gates in an LL7220 
gate array. 

The final benchmark is the arith- 
metic circuit, more specifically a 
multiplier/accumulator (MAC) (Fig. 
4). This circuit consists of a multipli- 
er, an 8-bit adder, and an 8-bit regis- 
ter. Such arithmetic circuits are em- 
ployed in FIR filters and other DSP- 
type applications. 

A 4-by-4-bit multiplier accepts ei- 
ther outside data or data from a pre- 
ceding benchmark circuit unit. The 8- 
bit multiplication results are added 
to previous accumulator results, and 
then the current 8-bit accumulator 
result is moved to the output regis- 
ter. The 1-bit MAC input controls a 
gating function on the adder’s input. 

The MAC circuit represents a com- 
bination of circuit types that test an 
ASIC’s ability to handle logic with 
both high and low pin-per-gate 
counts. The adders, as combinatorial 
functions, are high pin-per-gate sec- 
tions. However, the registers used in 
both the multiplier and the output 
register are low pin-per-gate func- 
tions. 

The largest pin-per-gate frequen- 
cy (three pins per gate) generally oc- 
curs with this type of benchmark cir- 
cuit, owing to the adder tree used to 
make up the array-multiplier (Fig. 
5). The sequential content of this 
benchmark has less than one pin per 
gate, as expected. Although one of 
the simplest benchmark circuits 
from a conceptual point of view, the 
arithmetic benchmark requires the 
most gates in the LL7220 gate ar- 
ray—295. 


STANDARD RESULTS 

The CALC benchmark results for 
LSI Logic gate arrays can be com- 
pared to the results for other ASICs 
supplying a reliable standard. The 
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LL7220, one of the industry’s most 
successful arrays, is commonly re- 
ferred to as a 2000-gate device— 
though it actually contains 2224 2-in- 
put NAND gate equivalents. With 
an 85% usability rating, the LL7220 
supplies 1890 usable gates. 

Actel’s ACT1020 desktop-confi- 
gurable, channeled gate array 
scores about the same as the LSI 
Logic LL7220 channeled gate array 
for all but one of the benchmarks 
(see the table). Therefore, even 
though the ACT1020 gate array im- 
plements an application circuit in 
logic modules instead of intercon- 
nected transistors, the device can be 
thought of as a 2000-gate gate array. 

These results were produced by 
first stepping and repeating the 
benchmark circuit, then using an 
automatic router to map the circuit 
into the device. The use of autorout- 
ing is critical because although it’s 
always possible to achieve better re- 
sults with a manual place-and-route 
procedure, few designers would pre- 
fer a manual procedure for an appli- 
cation circuit. Consequently, these 
circuit benchmarks will produce typ- 
ical outcomes only when autorouting 
is used. 
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MASSACHUSETTS Carlson Electronic Sales Associates ................ (414) 476-2790 

CompRep Аѕѕосіаівз............................ (617) 329-3454 CANADA , 

MICHIGAN Clark Hurman Associates (Ontario Brampton) |... (416) 840.6006 

MINNESOTA. ב ל‎ (313) :227:9898 Clark-Hurman Associates (Ontario-Nepean) .........- (613) 727-5626 

Gibb Technology Sales .......................... (612) 835-3370 

WYLE LABORATORIES 

ARIZONA OREGON 

РРО URS Ie Son E (602) 437-2088 Beaverton: ce ec mex nai eee cote al (503) 643-7900 

CALIFORNIA TEXAS 

Calabasas iis. berth has e Een ua (818) 880-9000 Austin: nn ae cb ed за DENDAN Ка. (512) 345-8853 

7 (714) 863-9953 Hoüston « оо оа ame s збы” me Berne (713) 879-9953 

Rancho Cordova. . . . (916) 638-5282 Richardson... ב‎ neta tees Read es ee (214) 235-9953 

San Diego . . (619) 565-9171 UTAH 

Santa Clara (408) 727-2500 ית‎ )801( 3 

COLORADO WASHINGTON 

MASSACHUSETIS LLL (303) 457-9953 Redmond- ul Su ae Dues aad en re he (206) 881-1150 


Bürlingtoni....... ck ee aa ns (617) 272-7300 


DOMESTIC DISTRIBUTORS (continued) 


PIONEER STANDARD ELECTRONICS 








NEW JERSEY 

Fairlield EPEN noL mro Aser eR ee N (201) 575-3510 
NEW YORK 

Binghamton... ..... г.а еган ERR (607) 722-9300 
атро: aus ean RN ent fe (716) 381-7070 
Woodbüry:: שיב גל בש‎ АУ (516) 921-8700 
OHIO 

Cleveland... act ren ER CE IA (216) 587-3600 
Dayton: о i een IRE EAER NEARE (513) 236-9900 
PENNSYLVANIA 
לביטספו‎ )412( 0 


)512( 0 
(214) 386-7300 
(713) 495-4700 





WISCONSIN 

Brookfield. о De Y. CM ed d (414) 784-3480 
MARYLAND 

Galihersbürg: serora a AA ae (301) 921-0660 
NORTH CAROLINA 

Сһапое:. 6-5 בה‎ e МЫ ere EETITMNT (704) 527-8188 
отат ur en EE TEE ET ES (919) 544-5400 
PENNSYLVANIA 

Horsham y ааыр ee amt a (215) 674-4000 
ONTARIO 

Mississauga ................................... (416) 564-9600 
Népeari-. ur ar ae d (613) 226-8840 
QUEBEC 

737-9700 )514( "בר ו 
Stan Ek: esee e Suyu osa ES Mt STR REN (418) 654-1077‏ 


INTERNATIONAL DISTRIBUTORS 


CONNECTICUT 

Norwalk: cca xine. о ан а E EON (203) 853-1515 
ILLINOIS 

Addison... ופ בש אוב ב‎ asia es vas (708) 495-9680 
INDIANA 

Indianapolis... (317) 573-0880 
KANSAS 

Call the Minnesota Office ......................... (612) 944-3355 
MASSACHUSETTS 

Eexingtori: a. ex e Sub eae e oc eu ated (617) 861-9200 
MICHIGAN 

ОӘгапа Варіаѕ .................................. (616) 698-1800 
ו‎ а eus Жак (313) 525-1800 
МІММЕЅОТА 

Eden Prairie... oss LET inne (612) 944-3355 
MISSOURI 

Call the Minnesota ОШсе......................... (612) 944-3355 
PIONEER TECHNOLOGIES 

ALABAMA 

Huntsvilles 2.00 cae es ewe ey ra (205) 837-9300 
CALIFORNIA 

San 0088 vv ope рде eee YL Pee a tnd КУУ (408) 954-9100 
FLORIDA 

Altamonte Springs (407) 834-9090 
Clearwater........ 2d PR ...(813) 536-0445 
Deerfield Ввасћ. ................................ (305) 428-8877 
GEORGIA 

ШЕШ 85. corneas pa Жа яле из» н». Кы rer u (404) 623-1003 
ZENTRONICS (Canada) 

ALBERTA 

Calgary. «meatu apt eh EN e ee neus (403) 295-8838 
Edmonton = «oves rrr CD EVE eie wer (403) 484-1669 
BRITISH COLUMBIA 

Richmond: а sata URDU NIE RP E OU (604) 273-5575 
MANITOBA 

Winnipeg > Жуу э 22 as dee Жиз לפ‎ RR IUE (204) 694-1957 
AUSTRALIA 


tO’ 953.9844 


Distributed by: 


MICROPROCESSOR 
& MEMORY 
DISTRIBUTION LTD. 
3 Bennet Court, 


Bennet Road, Reading, 
Berkshire RG2 ООХ. 


Telephone: (0734) 313232 
Fax: (0734) 313255 Telex: 846669 


THE PROFESSIONAL TECHNICA 
L 
FOR ALL ASIC REQUIREMENTS СЕ 


mu 
Benchmark Systems (Madras) ....................... (44) 413.866 








ISRAEL 
AS.T. Ltd. (Нега). араара (52) 58.33.55 
за SpA. (Milan) ..................... (2) 66.10.1370 
oration (Yokohama-Shi) .,.............. (45) 474.9037 
ıctronics Corporation (Kyoto) ............ (75) 951.8151 
onies, Inc. (Ѕеош) ..................... (2) 553.2997 
ANDS 
(Enschede)... yy k D ee (53) 33.03.36 
ynics A/S (Hvalstad)...................... (2) 84.50.70 
хеѕ S.A. (Barcelona) ................... (3) 217.23.40 
ores S.A. (Madrid) ...................... (1) 742.2313 
fSla) uy erecto aes aer E (8) 93.00.00 
LAND 
Begg) о ле оа (32) 53.63.75 
Corporation (Taipei) .................... (2) 521.1100 


8/91.2 


Actel Corporation 

955 East Arques Avenue 

$ unnyvale, CA 94086 
=” 408.739.4010 


. Technical Hotline 800.262.1060 “` 5172020-1 ' 


` 
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